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Preface 

In A Mathematical Olympiad Primer, Geoff Smith described the technique of inversion as a ‘dark art’. It is 
difficult to define precisely what is meant by this phrase, although a suitable definition is ‘an advanced technique, 
which can offer considerable advantage in solving certain problems’. These ideas are not usually taught in 
schools, mainstream olympiad textbooks or even IMO training camps. One case example is projective geometry, 
which does not feature in great detail in either Plane Euclidean Geometry or Crossing the Bridge, two of the most 
comprehensive and respected British olympiad geometry books. In this volume, I have attempted to amass an 
arsenal of the more obscure and interesting techniques for problem solving, together with a plethora of problems 
(from various sources, including many of the extant mathematical olympiads) for you to practice these techniques 
in conjunction with your own problem-solving abilities. Indeed, the majority of theorems are left as exercises to 
the reader, with solutions included at the end of each chapter. Each problem should take between 1 and 90 
minutes, depending on the difficulty. 

The book is not exclusively aimed at contestants in mathematical olympiads; it is hoped that anyone sufficiently 
interested would find this an enjoyable and informative read. 

All areas of mathematics are interconnected, so some chapters build on ideas explored in earlier chapters. 
However, in order to make this book intelligible, it was necessary to order them in such a way that no knowledge 
is required of ideas explored in later chapters! Hence, there is what is known as a partial order imposed on the 
book. Subject to this constraint, the material is arranged in such a way that related concepts are as close as 
possible together; this is complemented by a hierarchical division into chapters and sections. 

One concern is that a book of this depth would be too abstract. Wherever possible, both two-dimensional and 
three-dimensional full-colour diagrams are included to aid one’s intuition. 

I have assumed that the reader will have at least the cumulative knowledge contained in both A Mathematical 
Olympiad Primer and a typical A-level mathematics syllabus. I also recommend reading either Plane Euclidean 
Geometry or Crossing the Bridge, although this is not a prerequisite to understanding the content of this book. 

Be fruitful, and multiply. 


Adam P. Goucher, 2012 
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Combinatorics I 

Combinatorics is the study of discrete objects. Combinatorial problems are usually simple to define, but can be 
very difficult to solve. For example, a polyomino is a set of unit squares connected edge-to-edge, such that the 
vertices are positioned at integer coordinates. The four polyominoes with three or fewer squares are shown below: 



A natural question to ask is how many polyominoes there are of size n. We have already proved by exhaustion 
that this sequence begins {1, 1, 2, ...). After a little effort, you will discover that there are five tetrominoes 
(polyominoes of size 4) and twelve pentominoes (polyominoes of size 5). Although this is a very simple problem 
to state, it is very difficult to find a formula for the number of polyominoes of a particular size. Indeed, there is no 
known formula as of the time of writing, and no-one knows how many polyominoes there are of size 60. Even the 

conjectured asymptotic formula, P(n) ~ —, is unproved (it is possible that, for instance, P{n) —^imooT i nstea d). 

Counting polyominoes is a hard problem. Variants of this problem are substantially easier. For instance, suppose 
we restrict ourselves to polyominoes that can be created by stacking cubes in a vertical plane. To make things 
even easier, we consider rotations and reflections to be distinct, so the following arrangements are counted as two 
different polyominoes: 



Many seemingly different combinatorial problems can be shown to be equivalent. This question can be converted 
into an equivalent one by colouring the top cube in each column red, and the remainder green. We then proceed 
up each column in turn, noting the colour of each cube. The configuration below is associated with the string 
GRRGGRR. Every string must end in R for obvious reasons, so we may as well omit the final R and just 
consider the string of n - 1 letters, GRRGGR. 
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Since each of these polyominoes has a unique string, and vice-versa, we have a bijection between the two sets. 
Counting strings of a particular length is very easy (mathematicians would call this trivial ); there are 2" -1 strings 
of n - 1 letters chosen from {G. R\. Hence, there are 2" 1 of these restricted polyominoes. A third way of viewing 
this problem is to consider it to be an ordered partition of «; the above configuration corresponds to the sum 
7 = 2+ 1 + 3-t- 1. So, we have solved a third combinatorial problem: there are 2 n ~ 1 ordered partitions of n identi¬ 
cal objects into non-empty subsets. 

1. How many ordered partitions are there of n into precisely k subsets? 


What if we consider the partitions 2 + 1 + 3 + 1 and 3 + l + l+ 2tobe equivalent? In other words, what if order 
doesn’t matter? This problem can be rephrased by forcing the elements of the partition to be arranged in decreas¬ 
ing order of size, i.e. 3 + 2 + 1 + 1. The associated diagram of this partition is known variably as a Ferrers 
diagram or Young diagram. 


The partition numbers are {1,2,3,5,7,11,...!, as opposed to the ordered partition numbers 

{1, 2, 4, 8, 16, 32, ...j. Whereas the latter have a very simple formula, the formula for the unordered partition 

numbers is given by an extremely complicated infinite series by Hardy, Ramanujan and Rademacher: 


pin) = 


VF 

k =1 




m mod k\ gcd(w,&)=1 


sinh 


Don’t be perturbed by this; the combinatorics explored in this chapter are several orders of magnitude easier than 
the partition problem. We begin with the problem of colouring p beads on a necklace, where p is a prime number. 
This leads to an intuitive proof of Fermat’s little theorem, and a similarly combinatorial approach yields Wilson’s 
theorem. The idea of symmetry is essential, so we contemplate some group theory as well. 
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Burnside’s lemma 



Consider how many ways there are of colouring the 11 beads of this necklace either red or blue. This is an 
ambiguous question and there are many ways in which it can be answered: 

• “There are 2048 ways of colouring the necklace.” 

• “There are 188 ways of colouring the necklace.” 

• “There are 126 ways of colouring the necklace.” 

These answers are all valid, since the question was vague. If rotations and reflections are considered to be distinct, 
then the first answer is clearly correct (as 2 11 = 2048). If rotations are considered to be equivalent, but reflections 
are distinct, then the second is correct. The third answer applies when both rotations and reflections are equivalent. 

It is easy to derive the answer 2048 in the first instance, but the others are somewhat trickier. Probably the best 
way to count the number of possibilities is to use a result known as Burnside’s lemma. Firstly, we define what we 
mean by a symmetry. 


■ A symmetry is an operation we can perform on an object. Moreover, the set of symmetries must form a group under 
composition. For example, a group of rotations can be regarded as symmetries. [Definition of symmetry] 


In the first case of the necklace problem, we only consider the trivial group of one symmetry: the identity. In the 
second instance, we have the cyclic group of eleven symmetries (ten rotations and the identity). Finally, the third 
case requires the dihedral group of twenty-two symmetries (eleven reflections, ten rotations and the identity). 


R V 51 


A direct symmetry can be expressed as a sequence of rigid transformations, such as translations and rotations. For 
example, the red and blue Rs are related by a direct symmetry (rotation by n through their common barycentre). 
By comparison, the green R cannot be obtained from the red R by a sequence of rotations and translations, so is 
related to the red R by an indirect symmetry (in this case, a reflection). The composition of two direct or two 
indirect transformations is a direct transformation; the composition of a direct and indirect transformation is an 
indirect transformation. This idea can be succinctly represented as a 2 x 2 Cayley table : 


o 

D 

I 

D 

D 

I 

I 

I 

D 


■ An object is said to be fixed by a symmetry if it is unchanged by applying that symmetry. [Definition of ‘fixed’] 
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For example, the hyperbola x 2 = y 1 + 1 is fixed by a rotation of it about the origin, whereas the parabola y = x 2 is 
not. 


■ The number of distinct objects is equal to the mean number of objects fixed by each symmetry. [Burnside’s lemma] 


For the second case of the necklace problem, there are 11 symmetries. The identity symmetry fixes all 2048 
objects, whereas the ten rotations only fix two objects (the monochromatic necklaces). So, Burnside’s lemma 
gives us a total of (2048 + 10x2) = 188 unique necklaces. Similarly, for the third case, we observe that there 

must be 2 6 = 64 objects fixed by each of the 11 reflections, so we have ^ (2048 + 10x2 + llx 64) = 126 unique 
necklaces. That this gives an integer answer is a useful way to check your arithmetic. 



The cube has a group of 24 direct symmetries (and the same number of indirect symmetries). We can classify 
those 24 direct symmetries into five conjugacy classes: 

• 1 identity symmetry; 

• 6 rotations by - n about the blue axes; 

• 3 rotations by n about the blue axes; 

• 6 rotations by n about the red axes; 

• 8 rotations by - n about the green axes. 

2. Suppose we colour each face of a cube one of k colours. By considering the number of colourings fixed by 
each of the above symmetries, deduce the number of distinct colourings of the cube where rotations are 
considered equivalent. 


Fermat’s little theorem 
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We now generalise the previous question to a necklace of p beads (where p is prime) and c different colours. 

3. How many distinct ways can a necklace of p beads be coloured with c colours, where p is prime and c > 2? 
Rotations are considered to be equivalent, whereas reflections are distinct. 

4. Hence show that c p = c (mod p). [Fermat’s little theorem] 

Fermat’s little theorem only applies when the modulus is prime. If, instead, the modulus is composite, it is 
necessary to use a generalisation by Euler. Unlike Fermat’s little theorem, Euler’s generalisation does not appear 
to be a consequence of applying Burnside’s lemma to necklaces of n beads. 

■ If a and n are coprime, then a^ = 1 (mod n), where (p(n) is Euler’s totient function (the number of positive integers 
k < n which are coprime to n). [Euler-Fermat] 

Euler’s totient function can easily be computed when the prime factorisation of n is known. Specifically, we have 
the rule ip(a b) = ip(a) ip(b) if a and b are coprime, and ip(p n ) = (p - 1) p n ~ l . 

Suppose N = pq is a product of two distinct primes, each of which has hundreds of digits. Given N, there is no 
known algorithm capable of factorising it to find p and q in a reasonable (polynomial) amount of time. This can 
be used as the basis of a cryptographic system known as RSA (after its creators, Rivest, Shamir and Adleman). 
The idea is that we define a function, / : Z, v -> Z ;V , which the general public has access to. However, we keep the 
inverse function/ _1 secret. 

5. Suppose that b = f(a) = a d (mod N). Show that f~ l (b) = b e (mod N), where d e = 1 (mod <p(N)). [Basis of 
RSA] 

In other words, we publish a, d, N (and therefore /) but leave p, q, e secret. As it is impossible to compute e 
from d without knowledge of p and q, the general public cannot calculate / . Hence, they can encrypt an integer, 
but not decrypt it. As the numbers in can have hundreds of digits, it is possible to store a substantial amount 
of information in one integer. This is typically used to encrypt passwords, safe in the knowledge that there is no 
known algorithm for rapidly factorising semiprimes. 

Interestingly, there is an algorithm called AKS which enables a computer (or, more correctly, Turing machine) to 
determine whether a number is prime in polynomial time (in the number of digits), but actually factorising the 
number may require exponential time. Additionally, so-called ‘quantum computers’ are capable of prime factorisa¬ 
tion in cubic time, so a sufficiently powerful quantum computer would render RSA useless. Fortunately, this 
technology is a long way off, and the largest semiprime factorised by Shor’s algorithm as of the time of writing is 
15 = 5x3 using a machine with seven quantum bits. 
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Wilson’s Theorem 
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Suppose we have a pxp chessboard, where p is prime. We label each square with a coordinate (x, y), where x 
and y are considered modulo p (in effect, forming a toroidal surface). We then place an arrangement of p non¬ 
attacking rooks on the chessboard, i.e. one in every row and one in every column. We consider the group of p 2 
symmetries (one identity and p 2 - 1 translations). 


6. Show that there are p ! arrangements fixed by the identity symmetry. 


7. Show that no arrangements are fixed by any of the 2 (p - 1) horizontal or vertical translations. 

8. Show that p arrangements are fixed by each of the (p - l) 2 remaining translations. 


9. Hence determine the number of unique arrangements, where toroidal translations of the board are 
considered equivalent. 

10. Prove that (p - 1)! = -1 modulo p if p is prime. [Wilson’s theorem] 

If n is composite, then (n - 1)! = 0 modulo n, except where n = 4, in which case (n - 1)! = 2. Hence, the converse 
of Wilson’s theorem is also true. 


Packings, coverings and tilings 


Straddling the boundary between combinatorics and geometry is the idea of tessellations, or tilings. 

Consider a set S of [closed] tiles, each of which is a subset of some region R. If the pairwise intersection of any 
two tiles of S has zero area, then S is a packing. If the union of all tiles in S is the entirety of R, then S' is a cover¬ 
ing. If both of these conditions hold, it is a tiling. 



The diagram above highlights the differences. The first diagram is a packing using two blue circles. The second is 
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a covering using four red circles. The third diagram is both a packing and covering, and thus a tiling, using four 
green isosceles right-angled triangles. 

Using circles of unit radius, there are obviously no tilings of the plane. It is of interest to find the packing of the 
highest density and covering of the lowest density. 



It has been proved that the optimal packings and coverings of the plane using circles of unit radius are obtained by 
positioning them at the vertices of the regular triangular tiling. Other optimisation problems are solved by the 
hexagonal lattice, which is why honeybees favour hexagonal honeycombs as opposed to a rectangular Cartesian 
grid. In higher dimensions, less is known. For three dimensions, the optimal lattice packing of spheres is the face- 
centred cubic lattice A 3 = {(x, y, z) e Z 3 , x + y + z = 0 (mod 2)}, whereas the optimal lattice covering is the body- 
centred cubic lattice A 3 * = {( x , y, z) e Z 3 , x = y = z (mod 2)}. 



Each sphere in the face-centred cubic packing is adjacent to twelve other spheres. This suggests another packing 
problem: what is the maximum number of disjoint unit spheres tangent to a given unit sphere? In two dimensions, 
the answer is rather trivially six. In three dimensions, Isaac Newton conjectured that the maximum is indeed 
twelve spheres, whereas David Gregory hypothesised that thirteen could be achieved. It transpires that Newton 
was correct. The problem has also been solved in 4, 8 and 24 dimensions, again corresponding to the arrange¬ 
ments of spheres in very regular lattice packings (known as Z) 4 , E% and A 24 , respectively). A 24 (the Leech lattice) 
has so many interesting properties and profound connections that I cannot hope to list them all here. Nevertheless, 
its existence is related to string theory, error-correcting codes, the Monster group, and the curious fact that 
l 2 + 2 2 + 3 2 + ... + 24 2 = 70 2 . 

Colouring arguments 


To begin with, we ponder tilings of finite, discrete spaces. For example, consider a standard 8 x 8 chessboard with 
two opposite comers removed. Is it possible to tile the resulting shape with 311x2 dominoes? 
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If the chessboard is coloured as above, each domino must occupy precisely one blue and one red square. As there 
are 32 blue and 30 red squares, it is clearly impossible to tile it with 31 dominoes. 

The more general problem of determining whether a polyomino-shaped region can be tiled with dominoes can be 
embedded in graph theory. We represent the squares with vertices, and join vertices corresponding to adjacent 
squares. Some regions clearly cannot be tiled, even if they have equal quantities of squares of each parity. One 
such example is the following ‘octomino’, shown below with an equivalent bipartite graph: 


The lowest blue vertex in the graph is connected to three red vertices, two of which are exclusively connected to 
this blue vertex. It is therefore impossible to place disjoint dominoes to cover both of the corresponding red 
squares. However, the basic colour-counting argument is insufficient here, as there are four red and four blue 
squares. 

In effect, we want to find a bipartite matching between the red and blue vertices of the graph. A necessary and 
sufficient condition for there to exist an injection from the red vertices to the blue vertices is Hall’s marriage 
theorem. 

■ Let S be the set of red vertices, and T be the set of blue vertices. Consider each subset S'c5, and let T ' c T be the set 
of vertices directly connected to vertices in S’. Then there exists an injection from the red vertices to the blue vertices 
if and only if | 5" | < | 7” | for all subsets S'. [Hall’s marriage theorem] 

For a bijection, it is necessary and sufficient that there are equal numbers of red and blue vertices and the above 
result also holds. Returning to the octomino problem, note that the two red vertices of degree 1 are connected to 
the same blue vertex, so the marriage condition does not hold. 

Verifying the marriage condition can be a time-consuming process, as there are 2" subsets of red vertices for a 
bipartite graph with n red and n blue vertices. This is faster than checking every possible bijection, of which there 
are n !. Both of these algorithms are said to take exponential time. People are interested in fast, polynomial-time 
algorithms, as they usually can be executed in a reasonable amount of time. 

Colouring can solve much more general problems than the domino tiling problem. 

11. Determine whether it is possible to tile a 4x7 rectangle with (rotations of) each of the seven tetrominoes 
(where reflections are considered to be distinct). The seven tetrominoes are shown below: 
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12. Is it possible to tile a 6x6 rectangle with 15 dominoes and 6 non-attacking rooks? [Ed Pegg Jr, 2002] 


13. Show that the maximum number of (grid-aligned) k x k square tiles that can be packed into a m x n 
chessboard is given by |_~J J- 

In addition to determining whether or not a region can be tiled, it is occasionally possible to enumerate precisely 
how many ways in which this can be done. This is typically accomplished using recursion on the size of the 
region. 


14. In how many ways can a 2 x« rectangle be tiled with n dominoes? 


This is a simple case of what one would initially imagine to be a completely intractable problem: to count the 
number of domino tilings of a mxn rectangle. A remarkable discovery by Kasteleyn enumerates this for any 
planar graph, and thus how many domino tilings exist for any polyomino. In particular, a mxn chessboard can be 


n m 

tiled by dominoes in exactly n 11 

k=\ 1=1 


4 cos 2 -h 4 cos 2 

m+1 n+l 


ways. 


Regular solids and tilings 


Suppose we attempt to tile a surface with regular «-gons, where k n -gons meet at each vertex. To avoid trivial 
cases, we assume that both k and n exceed 2. The cases where the Schlafli symbol {«, k) is either [3, 3], {4, 3], 
[3, 4], {5, 3] and |3, 5] result in the five regular solids, namely the tetrahedron, cube, octahedron, dodecahedron 
and icosahedron. 

A e • ® 

They are also referred to as Platonic solids, as Plato believed that all matter was composed (at the atomic level) of 
minuscule cubes, tetrahedra, octahedra and icosahedra, associating each one with a different classical element. He 
reserved the dodecahedron for representing the entire universe. 

15. Each face of a regular dodecahedron is infected with either E. coli, S. aureus or T. rychlik bacteria. In how 
many ways is this possible, treating rotations as equivalent? [Adapted from Google Labs Aptitude Test] 


If {«, k) is [6, 3], {4, 4] or {3, 6], we obtain the hexagonal, square and triangular tilings, respectively, of the plane. 
The Platonic solids can be regarded as analogous tilings of the sphere. 
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If {«, k) is anything other than these eight possibilities, the sum of the angles around each vertex exceeds 2 n. This 
is only possible in the bizarre hyperbolic surfaces described by Bolyai-Lobachevskian geometry. 



On the complex plane, numbers of the form a + bi (a, b e Z) form a ring known as the Gaussian integers, which 
are positioned at the vertices of the square tiling. As Euclid’s algorithm can be applied to the Gaussian integers, 
the fundamental theorem of arithmetic still holds: Gaussian integers can be factorised uniquely into a product of 
Gaussian primes (up to multiplication by the units , 1,-1 ,i and -i). Not all ordinary primes are Gaussian primes; 
for example, 2 is not a Gaussian prime, as it can be factorised as (1 + i) (1 - i). 

Suppose we have a grasshopper initially positioned at the origin, which can only jump to a Gaussian prime within 
the disc of radius R centred on its current position. It is an unsolved problem as to whether there is some R for 
which the grasshopper can visit infinitely many Gaussian primes. 


• • • 3 

. • . 2 

• • • 1 

• • • 3 

• • . 2 

. i 

. 

-3 -2 -1 

• • . -1 

• . • -2 

• • • -3 

1 2 3 -3 -2 -1 

. . . * * -1 

• • • * * * -2 

• • . -3 

1 2 3 


Similarly, numbers of the form a + b a> (a, b e Z), where w is a primitive cube root of unity, form the ring of 
Eisenstein integers. They are positioned at the vertices of the triangular tiling. As with the Gaussian integers, the 
fundamental theorem of arithmetic applies. The units are the sixth roots of unity, namely {± 1, ±cj, ±ar}. It is 
possible to find the squared distance between two Eisenstein integers a and b by expressing the vector a - b in 
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terms of {l, cj, orj and calculating | a - b | 2 = (a - b) ( a * - b*), remembering that 1 + cj + or = 0 and w 3 = 1. 

16 . A set 5 of 99 points are drawn in the plane, such that no two are within a distance of 2 units. Prove that 
there exists some subset T c S of 15 points, such that no two are within a distance of yfl~ units. 


Aperiodic tilings 


As we noted, the only regular polygons capable of tiling the Euclidean plane are the triangle, square and hexagon. 

9 

Pentagons cannot, as three pentagons at each vertex have an interior angle sum of - n, which is slightly less than 

2 n and causes the pentagons to ‘curl up' into a dodecahedron. Similarly, attempting to place four or more pen- 

12 

tagons around each vertex results in a hyperbolic tiling, as — n>2n. 

More strongly, there is no tiling of the plane which exhibits both translational symmetry and order-5 rotational 
symmetry. To prove this, we assume without loss of generality that the tiling is fixed by both a translation parallel 


-w 


to the vector | n | and a rotation by - it about the origin. In that case, it is possible to map the origin to any point 
expressible as the sum of fifth roots of unity. 


-3 


The points on the real axis expressible in this way are those of the form a + b(p, where a, b e Z and 
<f> = ^ (1 + V5"). As 0 is an irrational number, these points form a dense subset of the reals, i.e. for every e > 0, 

every point x on the real axis is within a distance of e from a point of the form a + b<f>. This means that the tiling 
must be composed of infinitesimally small tiles, which contradicts our notion of discrete tiles. 
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If we dispose of the translational symmetry, we can indeed have tilings with order-5 rotational symmetry. Perhaps 
the most famous is an aperiodic tiling known as the Penrose tiling (above), formed from interlocking ‘thin’ and 
‘thick’ rhombi in the ratio 1 : (f>. It is a remarkable fact that every tiling of the plane with these two tiles (and 
certain matching rules) exhibits this ratio, and is thus aperiodic (since <p is irrational). An unsolved problem is 
whether there is a single connected shape (an ‘aperiodic monotile’), which can only tile the plane aperiodically. 
Joshua Socolar and Joan Taylor recently (2010) discovered a disconnected aperiodic monotile based on the 
hexagonal honeycomb, suggesting that there may indeed be a connected variant waiting to be found. 

There is a three-dimensional analogue of the Penrose tiling. It is formed from equilateral parallelepipeds (three- 
dimensional rhombi) and displays icosahedral symmetry. Crystallographers were very surprised to find naturally 
occurring crystals with this structure, termed ‘quasicrystals’. It was previously believed that solids could only be 
either periodic crystals or totally irregular. 


Invariants 


An invariant is, as suggested by the name, something that doesn’t change. One of the simplest invariants is parity: 
whether something is even or odd. Integers are one of the most common things to display parity; however, the 
idea is equally applicable to other things such as permutations. To realise that permutations have a parity, it is 
necessary to consider them in a more geometrical light. 

A 

An n-simplex is a regular /z-dimensional figure (polytope) with n + 1 vertices, which is fixed under any permuta¬ 
tion of the vertices. The 1-simplex, 2-simplex and 3-simplex are the line segment, triangle and tetrahedron, 
respectively, as in the above diagram. Interchanging two of the vertices of a simplex can be regarded as a reflec¬ 
tion. For example, reflecting a regular tetrahedron ABCD with circumcentre O in the plane O CD causes the 
vertices A and B to be swapped. 
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This suggests two different sets of permutations: the odd permutations, which correspond to indirect isometries of 
R"; and even permutations, which correspond to direct isometries. A k- cycle (cyclic permutation of some subset 
containing k elements) is an odd permutation if k is even, and vice-versa. In particular, 2-cycles (or swaps) are 
odd permutations. 

The set of even permutations of n elements forms a group known as the alternating group A n . This is a subgroup 
of the group of all permutations, known as the symmetric group S„. Any composition of even permutations is 
itself an even permutation, which can form a useful invariant. For example, it shows that not all conceivable 
configurations of a Rubik’s cube can be attained by applying legal moves to the initial ‘solved’ position. 

17. Suppose we have a hollow 4x4 square containing 15 unit square tiles and one empty space, into which any 
adjacent tile can be moved. The fifteen tiles are numbered from 1 to 15. Determine whether it is possible to 
get from the left-hand configuration to the right-hand configuration in the diagram below. [Sam Loyd’s 15 
puzzle] 


1 2 3 4 1 2 3 4 

5678 Q 5678 

9 10 11 12 • 9 10 11 12 

13 14 15 13 15 14 


Instead of an invariant, it is possible to define a value that only changes in one direction, known as a monovariant. 
This is useful for proving that a process (such as a perturbation argument) eventually terminates. 

18. There are n red points and n blue points in the plane, no three of which are collinear. Prove that it is 
possible to pair each red point with a distinct blue point using n non-intersecting line segments. [EGMO 
2012, Friday bulletin] 


Solitaire 


Quite a few interesting problems pertain to the game of peg solitaire. We have a (possibly infinite) board, which is 
a subset of Z 2 containing some (possibly infinite) initial configuration of identical counters. The only allowed 
move is to jump horizontally or vertically over an occupied square to an unoccupied one; the piece that has been 
jumped over is removed. This is demonstrated below. 


19. Suppose we have a game of solitaire on a bounded board beginning with the configuration of 32 pieces 
shown below. Show that if we can reach a position where only one piece remains on the board, then we can 
do so where the piece is in the centre. 
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20. We begin with an infinite chessboard, and divide the board into two half-planes with a straight horizontal 
line. All squares below the line are occupied with counters; all squares above the line are unoccupied. Show 
that it is impossible, after a finite sequence of moves, for a counter to occupy the fifth row above the line. 

[Conway’s soldiers] 

21. Suppose we have an infinite chessboard with an initial configuration of n 2 pieces occupying n 2 squares that 
form a square of side length n. For what positive integers n can the game end with only one piece remaining 
on the board? [IMO 1993, Question 3] 
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Solutions 

1. We are enumerating strings containing precisely k - 1 Rs and n-k Gs. Hence, the number of ordered 

( it — 1 j (tt-iv 

partitions of n into k subsets is given by the binomial coefficient I I = ^ . 

2. All k 6 colourings of the cube are fixed by the identity. Consider a rotation by ^ n about the vertical blue 

axis. The top and bottom faces can be any colour, whereas the four other faces must all be the same colour. 
Hence, each of the 6 symmetries in this conjugacy class fix Ac 3 colourings. By similar reasoning, the 3 
rotations by n about the blue axes each fix k 4 colourings. The 6 rotations about the red axes each fix A : 3 
colourings, whereas the 8 rotations by | n about the green axes fix only k colourings. Applying Burnside’s 

lemma, the total number is ^ (Ac 6 + 3 A : 4 + 12 Ac 3 + 8 Ac 2 ). 

3. There are p symmetries, namely the identity and p — 1 rotations. The former fixes all n p colourings, 
whereas the latter fixes only the n monochromatic necklaces. Hence, we have - (n p + n(p - 1)) unique 
necklaces. 

4. The result of the previous question is an integer, so c p + c(p - 1) is divisible by p. Hence, c p + c p - c = 0. 
As c p = 0, this means that c p = c (mod p). 

5. Note that <p(N) = <p(p) <p(q) = (p - 1) (q - 1). Expressing b in terms of a, we obtain b e = a de . As 

a‘ f{N] = i ( moc j y\t) by Euler-Fermat, and d e = 1 (mod ip(N)), a de =a l = a (mod N), so is precisely the inverse 
function we are looking for. 

6 . The position of the rooks can be regarded as a bijection mapping rows to columns. There are p ! 
permutations of p elements. 

7. Without loss of generality, just consider horizontal translations by (a, 0). If there is a rook in (x, y), there 
must also be a rook in (x + a, y), contradicting the assumption that the rooks are non-attacking. 

8 . Consider the rook positioned at the coordinates (x, 0), and let the translation be parallel to vector ( a, b). 

This forces there to be rooks in positions (x + a, b), (x + 2 a, 2 b), ..., (x - a, -b). Hence, the arrangement is 
determined uniquely by the abscissa of the rook in the 0th row, of which there are p possibilities. Hence, p 
arrangements are fixed by each of these translations. 

9. We have — 1 [p\ + p(p ~ l) 2 ) distinct arrangements by Burnside’s lemma. 

10. The previous answer must be an integer, so p ! + p(p - l ) 2 = 0 (mod p 2 j. Dividing throughout by p, we 
obtain (p - 1)! + (p - l ) 2 = 0 (mod p). We can expand this to yield (p - 1)! + p 2 - 2 p + 1 = 0 (mod p). As 
p 2 and 2 p are divisible by p, we can eliminate those terms, resulting in the statement of Wilson’s theorem. 

11. Colour the squares black and white, as on a standard chessboard. The T-shaped tetromino must cover three 
black squares and one white square (or vice-versa), whereas each of the other tetrominoes cover precisely 
two squares of each colour. As the chessboard features equal numbers of black and white squares, this is 
indeed impossible. 

12. Colour the squares black and white, as on a standard chessboard. The six rooks are positioned on squares 
( i, cr(i)), where <r is a permutation of {1, 2, 3, 4, 5, 6 ). Select two rooks at positions ( i, cr(i)) and (j, cr(j)), 
and move them to (i, cr(j)) and (j, cr(i)), respectively. Applying this move does not alter the parity of rooks 
on white squares. Since we can do this until they lie on the long diagonal of white squares, it is clear that 
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there must have been an even number of rooks on white squares to begin with. However, the constraint that 
the remaining 30 squares can be tiled by dominoes forces the rooks to occupy three white and three black 
squares, which contradicts the previous statement. Hence, it is impossible. 


13. Represent each square with coordinates ( x , y), where re|l, 2, ..., m] and y e 11, 2, ..., n). Colour the 
square blue if x = y = 0 (mod k), and white otherwise. Clearly, each tile must conceal precisely one blue 
square, and there are only [~ J [~ J of them. This bound is attainable. 

14. Let this number be denoted /(«). Either the rightmost 2 x 1 rectangle is a (vertical) domino or the rightmost 
2x2 rectangle is a pair of horizontal dominoes. Now consider how many ways there are of tiling the 
remaining area. In the first case, there are f(n - 1) possible configurations; in the second, there are f(n - 2). 
This gives us the recurrence relation f(n) = f(n - 1) + f(n - 2). Together with the obvious fact that 

/(1) = 1 and /(2) = 2, this generates the Fibonacci sequence, f(n) = F(n +1). 

15. There are 60 symmetries of the regular dodecahedron. The identity symmetry fixes all 3 12 infections. There 
are 24 rotations about axes passing through the centres of opposite faces, each of which fix 3 4 infections. 
The 15 rotations about axes passing through the midpoints of edges each fix 3 6 infections. Finally, the 20 
rotations about axes passing through opposite vertices each fix 3 4 infections. By Burnside’s lemma, there 
are (3 12 + 24x 3 4 + 15 x 3 6 + 20x 3 4 ) = 9099 unique infections of the dodecahedron with three strains of 
bacteria. 

16. Tile the plane with the regular hexagonal tiling, where each hexagon has side length 1. Clearly, no two 
points in S can occupy the same hexagon. 7-colour the hexagons in a repetitive fashion, such that each 
hexagon is adjacent to six hexagons of different colours. By the pigeonhole principle, at least 15 of the 
points must lie in identically-coloured hexagons. It is straightforward to show that no two of those points 

can be within of each other, by considering the closest approach of the vertices of the hexagons and 
using cube roots of unity to calculate the distance: the arrow shown in the honeycomb below has a complex 
vector of 2 - to, which has squared length (2 - to) (2- nr) = 4 - 2 (go + w 2 ) +1=7. 
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17. Label the empty space with 0, so we can regard this as a permutation of {0, 1, ..., 15}. Consider the parity of 
x + y, where (x, y) is the location of the empty space, together with the parity of the permutation <x. Note 
that each move flips both parities, thus leaving the total parity of x + y + tr unchanged. However, 
interchanging any two tiles without moving the empty space alters the parity of x + y + cr, so it is impossible 
to get from the left configuration to the right configuration. 



18. Biject them in an arbitrary way using n line segments. If we encounter a configuration of four points joined 
by two intersecting line segments, as above, then we can replace the line segments with disjoint line 
segments. Let the monovariant E be the total length of line segments. E strictly decreases at each step (by 
the triangle inequality), so the process cannot cycle. As there are only finitely many bijections between red 
and blue points, the process must terminate with n disjoint line segments. 
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19. Firstly, colour the tile at coordinates ( x , y) either red, green or yellow depending on the value of x + y 

modulo 3, where we consider the central tile to be the origin (coloured red). As the parities of red, green and 
yellow counters all change simultaneously when a solitaire move is played, the final counter must be on a 
red square. However, we can also colour the tiles depending on x - y modulo 3, resulting in a perpendicular 
pattern of colouring as shown above. The only tiles that are red in both colourings are given by (3 i, 3 j), 
where i and j are integers. On the bounded board, there are only five such tiles. Backtracking by one move 
must result in a configuration equivalent to the one shown below, in which case we can trivially jump to the 
central square. 



20. Assume that it is possible to reach a square in the fifth row, in attempt to derive a contradiction. Without 
loss of generality, we will use T = (0, 0) as the ‘target square’. For each square (x, y), we assign a value of 

(]r ( \ x \ + \y\\ where | x | + | y | is the Manhattan distance between (x, y) and (0, 0), and <p = l+ ^ 5 is the golden 

ratio. Let E be the sum of the values of the occupied squares. If a counter on a square of value tj> k jumps 
over one of value cp k+1 , this results in a single counter on a square of value less than or equal to <f> kJrl . As 
<p k+2 = (j> k+1 + <p k , the value of E cannot increase. At the beginning of the game, the value of E can be 
calculated by summing some geometric progressions; it is simple to show that this value equals 1. As the 
value of the target square is also 1, it is necessary to use all of the counters to reach it. However, that is 
impossible in a finite amount of time, as there are infinitely many counters. 

o o o o 

ooo — ooo — ooo — ooo 

o o o o 

21. For n = 1 and n = 2, this is trivial. If we have an arrangement shown above, it is possible to ‘delete’ three 
adjacent pieces. This can be used, rather effectively, to reduce a problem from « = 3 k + 4 to n = 3 k + 2 by 
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deleting the outermost ‘layer’ of pieces, as in the diagram below. Similarly, we can reduce a problem from 
n = 3 k + 5 to /; = 3 k + 1 by deleting the outermost two layers. By induction, we can solve the problem for 
all n except for multiples of three. If n is a multiple of three, we colour the tile at coordinates (x, y) either 
red, green or yellow depending on the value of x + y modulo 3. Let the number of pieces on red, green and 
yellow tiles be indicated by R, G and Y , respectively. Note that if (-1)^ = (-l) c = (-l) r before a solitaire 
move, then it will remain true afterwards. This condition is clearly true for a 3 kx3 k square of pieces, but 
false for a single piece. Hence, we cannot reduce the arrangement to a single piece if n is a multiple of 3. 
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Linear algebra 

The second chapter of this book is concerned with vectors, matrices and linear transformations. Determinants are 
introduced, together with ways in which to calculate them. These concepts are particularly relevant in analytic 
geometry, where we use them to describe projective transformations. 


Linear transformations 


Linear transformations are transformations of n-dimensional Euclidean space R" expressible asi-*Mr, where 
x = (x i, X 2 , ..., x n ) is the position vector of a point X. M is known as the transformation matrix. For example, the 

(l l\ 

linear transformation with matrix I ^ I is shown below. 




The position of the origin, O , is left unchanged by a linear transformation. Degree-c/ algebraic curves remain as 
degree-c/ algebraic curves; in particular, lines map to lines and conics map to conics. In the shear shown above, a 
circle is transformed into an ellipse. Parallel lines remain parallel when linear transformations are applied. 
Finally, the (signed) area of any shape is multiplied by det(M) when the transformation is applied, where det(M) 
is the determinant of the transformation matrix. Hence, ratios of areas remain unchanged. 

Common linear transformations include rotations (about the origin), reflections (in lines through the origin), 
dilations (where the origin is the centre of homothety) and stretches (again, preserving the origin). One can 
combine transformations by multiplying their matrices. 


1. Let A = (1, 0, 0), B = (0, 1,0) and C = (0, 0, 1) be three points in R 3 . After applying the transformation with 
a b c 


matrix M = 


d e f 
8 h i 


find the new locations of A, B and C. 


Consider the unit cube [0, l]x[0, 1 ] x[0, 1], where x denotes Cartesian product. It is transformed into a paral¬ 
lelepiped with volume V = det(Af). 



In the diagram above, the blue cube is transformed into the red parallelepiped. The origin (the common vertex of 
the cube and parallelepiped) remains fixed. 
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Determinants 


The determinant of a square matrix M is a positive real number det( M) associated with that matrix. It behaves 
like the norm of a complex number, in that it is multiplicative. 

■ For two square matrices A and B of equal dimension, det(A B) = det(A) det(B). [Multiplicativity of determinants] 

If a matrix A has an inverse matrix A -1 such that A A -1 = A ' 1 A = 7, then det(A) del (A = det(7) = 1. Hence, it is 
clear that a matrix with a determinant of zero has no inverse. Indeed, the converse is also true: all square matrices 
with non-zero determinants possess unique well-defined inverses. If a matrix is one-dimensional, then its 
determinant is equal to its only element. Otherwise, we compute it recursively. 


' a 1,1 

01,2 ' 

' Q,\,n 

«2,1 

02,2 ' 

• a 2 ,n 

1 

On,2 ' 

' &n,n 


Consider the matrix above. We compute the determinant using the following process: 

• For some 1 < i < n, consider the element a, \ in the first column of M. 

• The (n - l)-dimensional matrix A7, is obtained by removing everything in the same row or column as a, \. 

• Compute the value S, = a,j det (M,). 

• Then, we have det(M) = Si - S 2 + S 3 - S4 + ... + (-1 )" +1 S„. 

This recursion results in the determinant equating to a sum of n ! terms, each of which is a product of n elements 
of M. After expanding this somewhat complicated recursive definition, we reach a more elegant formulation. 


■ det(M) = Yi (- 1 ) P< - ,r> (ai.o-(i) 02,o-(2) ■■■ o n ,o-(n)), where the sum is taken over all permutations cr of {1, 2, 3, ..., n\. We 

sym 

define P(cr) to be even if cr is an even permutation, and odd otherwise. [Leibniz formula for determinants] 


2. Express det 


( x y z } 
z x y 
y z x, 


as a polynomial in x, y, z. 


You may have noticed that for 3x3 determinants, the even permutations correspond to the three NW-SE 
‘diagonals’ and the odd permutations correspond to the three NE-SW ‘diagonals’. The diagonals are considered to 
wrap around the edges of the matrix as though it were a cylinder. This trick is known as the Rule ofSarrus. 


a b c a b c 
d e f d e f 
8 h i g h i 


aei + bfg + cdh -afh-bdi-ceg 

Leibniz’s formula requires «(«!) elementary operations, so is rather time-consuming for large matrices, taking 
exponential time. Instead, it helps to simplify the calculation by performing operations on the matrix. 
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■ Applying elementary operations to the rows or columns of M cause its determinant to behave in a predictable manner: 

■ Multiplying any row or column of M by x causes the determinant of M to be multiplied by x; 

m Adding (or subtracting) any multiple of one row to another row does not affect the determinant of M ; 

■ Swapping any two rows causes det(M) to be multiplied by - 1 ; 

■ The transpose of M has the same determinant as M. 

This can also be used to easily factorise the determinants of matrices. 



1 

1 

1 

3 . Factorise det 

X 

y 

z 


v x 3 

y 3 

Z 3 , 


into four linear factors. 


Interpolating curves 


The determinant of a matrix is zero if and only if one row can be expressed as a linear combination of the others. 
This is known as linear dependence. This enables one to create a curve of some type ( e.g. a polynomial, circle or 
conic) interpolating between various points. For example, if we have a sequence of n points (jc,-, y,), then the 
following curve is a degree-(« - 1) polynomial passing through all n points. 


The curve det 


polynomial] 


2 3 

y x x x 

yi xS xi 3 

y n x n x n x n 


x "- 1 

n -1 


Xi 


= 0 passes through all points (x,-, y,). [Lagrange interpolating 


This is obvious, as the determinant equals zero if two rows are identical. It is also a degree-(« - 1) polynomial, as 
we can use the recursive determinant formula to express it as + Ai y + A 3 x + A4 x 2 + Ag x 3 + ... + A n+ \ x n ~ l 
and rearrange it. If A 2 = 0 then this method will fail, but that only occurs if two points have the same abscissa. 

Using this idea, we can create a unique conic passing through any 5 points in general position, a cubic passing 
through 9 points et cetera. If the points are not in general position, then seemingly paradoxical things can occur. 
This forms the basis of the powerful Cayley-Bacharach theorem explored in the projective geometry chapter. The 
general equation of a conic is A + B x + C y + D x 2 + E y 2 + F x y = 0, so we can determine the equation of the 
conic passing through five given points. 



' 1 

X 

y 

X 2 

y 1 

xy 


1 

Xi 

yi 

Xi 2 

y 2 

xi yi 

The conic 

1 

Xi 

yi 

Xi 

yi 2 

xi yi 

1 

x 3 

T 3 

X 2 

y 2 

x 3 y 3 


1 

X4 

T 4 

V . 2 

T 4 2 

x 4 y 4 


„ 1 

x 5 

T 5 

V 2 

-^5 

ts 2 

X5 ys 


= 0 passes through all points (x,-, y,). [Interpolating conic] 


Circles also have a simple characterisation in Cartesian coordinates. 


4. Find the equation of the circle passing through the non-collinear points (jci, y 1), (xi, >’ 2 ) and (*3, >'3). 

[Circumcircle equation] 

The determinant formula is not constrained to Cartesian coordinates; it can be used to find interpolating curves in 
any coordinate system, such as projective homogeneous coordinates, areal coordinates, complex numbers and 
even polar coordinates. As we cover the other coordinate systems in greater depth later in the book, it is worth 
messing around with polar coordinates here. 
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■ The point with polar coordinates P = {r , 9) in the Euclidean plane is defined such that 0 P has length r and makes an 
angle of 9 with the positive x-axis. In Cartesian coordinates, P = (rcos 6, rsin 6). [Definition of polar coordinates] 

Although the value of r is uniquely defined, 6 is not; adding or subtracting multiples of 2 n will describe the same 
point. This is a consequence of the periodicity of the elementary trigonometric functions. 

5. Let Q = {r\, 6 1 ) be a point on the polar plane. Show that the equation of the circle with centre Q and radius 
a is given by r 2 + r \ 2 - 2 r r\ cos (6 - 6 \) = a 2 . [Polar equation of a circle] 

6. Hence show that a circle has general equation Ar 2 + B r cos 0 + C r sin 0 + D = 0. [General polar equation 
of a circle] 

It now becomes more obvious why this should work: the general equation for a circle in Cartesian coordinates is 
A(x 2 + y 2 ) + B x + C y + D = 0, and we have x 2 + y 2 = r 2 , x = r cos 8 and y = r sin 8 . 

7. Find the equation, in polar coordinates, of the circle passing through the non-collinear points (f \, 0\), 

</' 2 , 82 ) and (r^, 82 ). [Circumcircle equation for polar coordinates] 

If three of the points are collinear, the term in r 2 vanishes and we are left with the equation of a line. 

A curve which is particularly amenable to expressing in polar coordinates is the Archimedean spiral. If the spiral 
is centred on the origin, then it has polar equation r = r~( 6 ~ <p). h is the separation between successive turns of 
the spiral, and 0 is the angle at which is emerges from the origin. 
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8 . Find the equation for an Archimedean spiral of centre O passing through {r\, 8 \) and (j 2 , 62 )■ 

[Interpolating spiral] 

Adding multiples of 2 n to either of the angles can alter the number of turns on the spiral and its direction. There 
is not a unique interpolating spiral with centre O passing through two given points; there are countably infinitely 
many. 


Geometric transformations 


So far, we have considered linear transformations. If we compose an arbitrary linear transformation with an 
arbitrary translation, then we obtain an affine transformation. Affine transformations have all the geometric 
properties of linear transformations, but do not necessarily preserve the origin. They are a special case of projec¬ 
tive transformations, which are covered in a later chapter. 



Affine transformations are projective transformations which preserve the line at infinity. Linear transformations 
also preserve the origin, whereas similarities preserve (or reverse) the circular points at infinity (thus mapping 
circles to circles). Congruences are similarities with a determinant of ± 1, whereas homotheties are similarities 
which preserve the direction of all lines (thus all points on the line at infinity). Translations (and reflections in a 
point) lie in the intersection of congruences and homotheties. 

Do not worry if these terms are unfamiliar to you; they are explained properly in later chapters. 


Scalar product 


' a l ' 


'b 1 ' 


'Cl ' 

a 2 

, b_ = 

b 2 

and c = 

C2 

,03 . 


>3 , 


< C 3 ) 


be three vectors in R 3 . 
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■ The dot product (or inner product, or scalar product) a■ b = a\ b\ + a 2 b 2 + «3 b 2 = | a | \b_\ cos 6, where 9 is the 
angle between the vectors a and b. [Definition of dot product] 

The dot product is commutative and distributive, so a-b = b-a and a-(b + cj = a-b + a-(_. 

9. Prove that, for every triangle ABC, we have a 2 = b 2 + c 2 - 2b c cos A. [Law of cosines] 

The dot product generalises to vectors in IR". This allows us to interchange between trigonometric, geometric and 
algebraic inequalities. 

■ The following three statements are all equivalent: 

■ cos 9 < 1, with equality if and only if 0 = n n for some integer n; 
m q_-b_< \ a\ \b_\, with equality if and only if the vectors have the same direction; 

maibi + a 2 b 2 + ... + a„b n < xj ai 2 + a 2 2 + ... + a n 2 sj £>i 2 + b 2 2 + ... + b n 2 , with equality if and only if a t = A £>,- 
for some scalar A e [0, oo], [Cauchy-Schwarz inequality] 

We can generalise the idea of a vector to a more abstract object, and thus extend the Cauchy-Schwarz inequality 
even further. See Introduction to Inequalities (Bradley) for an example of this. 

Another application of the dot product in inequalities is a proof of the rearrangement inequality. That states that 
if we have two non-negative sequences of equal length and multiply corresponding terms, the product is greatest 
when the sequences are sorted in the same order. 

■ Suppose that a\ > a 2 > ... > a„ > 0 and b\ > b 2 > ... > b„ > 0 are two decreasing sequences of non-negative integers. 

n n 

Then X a,- b, > X a, b, r(l) for any permutation cr. [Rearrangement inequality] 

i= 1 i=l 


Proof: 

We can prove this by considering the vectors a and b in the space IR". Observe that all n ! vectors in \ly r } (the set 
of vectors obtained by permuting the elements of b) are of equal length, so lie on a sphere with centre 0. The dot 

n 

product X a , b^ of the vectors a and b^ is greatest when the angle between them is smallest, which occurs when 

i= 1 

a and ly r are closest (as all vectors in \ly r ] are of equal length). So, this has been converted into the equivalent 
problem of proving that b is the closest vector to a in {b^ }. We consider the Voronoi diagram of IR", which is 
simply a division of space depending on which ly r is closest. 



The diagrams above illustrate the cases when n = 2 or n = 3. The Voronoi diagram is created by the set of planes 
of the form x t = x p which each partition space into the regions x, > xj and x, < x - r This means that the regions of 
the Voronoi diagram are determined by the ordering of the elements; in the case where n = 3, we have six tetrahe¬ 
dral regions, namely xi > x 2 > x 2 and the five other permutations. As the elements of a and b are ordered in the 
same way, they must inhabit the same region. Hence, b is the closest vector in \ly r ) to a, and we are finished. 
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Vector and triple products 


So far, we are able to ‘multiply’ two vectors in IR", resulting in a scalar. We can also define a vector (cross) 
product, which is specific to R 3 . (There is also a 7-dimensional version based on the octonion algebra, but that is 
outside the scope of the book.) 


■ The cross product (or vector product, or exterior product) axj = 


' 02 bi - 03 b 2 ' 


' i_ a\ b\ ' 


03 b\ - Q\ b3 

= det 

j a 2 b 2 

, where i, j, k are 

, fll &2 - 02 b 1 


, k_ 03 &3 , 




1 


f° 


f 0 

the unit vectors 

0 

, 

1 

and 

0 


,0, 


,0 . 


, 1 , 


respectively. [Definition of cross product] 


The cross product is anti-commutative and distributive, so axfo = - Jxa and ax(fo + c) = axb + axe. The vector 
axb is perpendicular to both a and b_, and its magnitude is equal to the area of the parallelogram with vertices 
{0, a, b, a + b J. 

Finally, we define the scalar triple product, which is the volume of the parallelepiped with vertices 
{0, a, b, c, a + b, b + c, c + a, a + b + cj. 


a-(bxc) = (ax b)-c = det 


0\ b\ C! ' 
02 &2 C2 

l«3 h c 3 


. [Scalar triple product] 



Sir William Rowan Hamilton once had an epiphany whilst crossing a bridge, and carved the formula 
i 2 = j 2 = k 2 =ijk = -1 into one of the stones. This defines an extension to the complex numbers, which has four 
orthogonal units (1, i, j, k) as opposed to two. A Hamiltonian quaternion is a number of the form 

( x\ 


w + xc + yj + zk, where w, x, y, z e R. Using a slight abuse of notation, this can be written as w + 


y 


. A scalar 


V z 


added to a vector?! We can multiply two quaternions p = a + b and q = c + d_ together to give the quaternion 
p q = (a c - b_-d) + (a d_ + c b_+ bxd). Multiplication of quaternions is associative and distributive, but not 
commutative; p q + q p in general. This is inherited from the non-commutativity of the cross product. 


The quaternion w + xi + y j + zk has a norm of -y w 1 + x 2 + y 2 + z 2 ■ As with complex numbers, 
| p | | q | = [ p q | for any p, q e IH, where H is the set of all quaternions. 
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Solutions 



(a ' 


' b ' 


' c ' 

1. Using matrix multiplication, we get A = 

d 

,8. 

,B = 

e 

A 

and C = 

/ 

, i . 


x y z 


2. det 


Z x y 
y z x 
each x y z. 


= x 3 + y 3 + z 3 - 3 x y z, as the NW-SE diagonals are x 3 , y 3 , z 3 and the NE-SW diagonals are 


1 


3. We deduct the first column from the other two, obtaining det 


3 3 3 3 3 

x 3 y 3 - z - x 3 


. Applying the recursion 


formula reduces this to det 


y-x 


z - x 


3 3 3 

y 3 - x 3 z 


. We then divide the first column by y - x and multiply the 


entire determinant by y - x, obtaining (y - x) det 


1 


z - x 


9 9 9 9 

y z + x z + x y z- x 3 


. Applying a similar factorisation 


( 1 If 

to the second column results in (y - x) (z - x) det . Leibniz’s formula can now 

\x 2 + y- + xy x 2 + z 1 + xz ) 

be used to expand the determinant, giving (y - x) (z - x) (x y - xz+ y 2 - z 2 ). The quadratic factorises to 
(y - z) (x + y + z), so the entire determinant is equal to (y — x) (z — x) (y — z) (x + y + z). 


4 . det 


1 x y x 2 + y 2 
1 xi yi xr + yi 2 
1 X 2 J2 X 2 2 + Xl 
1 x 3 y 3 x 3 2 + y 3 2 

A + Bx+Cy + D(x 2 + y 2 ) = 0. 


= 0 will suffice, as the general equation for a circle is 


5. Let P = (r , 6) be a point on the circle, so P Q = a. By using the cosine rule, we have 
a 2 = r 2 + r 2 -2 rr\ cos (8 - 6 1 ). 

6. Using the compound angle formula, we get r 2 - 2 r r\ cos 9\ cos 9 - 2 r r\ sin 8\ sin 6 + r\ 2 - a 2 = 0. By 
altering 9\ and r \, we can change the coefficients of r sin 9 and r cos 8 to anything. Similarly, altering a 
enables us to change the constant term. Multiplying out by a constant scaling factor enables the coefficient 
of r 2 to be changed. Hence, the general equation is simply Ar 2 + Br cos 8 + C r sin 9 + D = 0. 


'1 

r 2 

rsin0 

r cos 8 

1 

n 2 

r\ sin 8 

r\ cos 9 

1 

r 2 2 

r 2 sin 8 

r 2 cos 8 

,1 

r 3 2 

r 3 sin 8 

r 3 cos 8, 


(1 


8. The general spiral has equation A + B r + C 9 = 0, so an interpolating spiral is det 


9 


l n 9i 

1 r 2 @2 


= 0 . 


9. Consider the triangle O A B. (| a - b_ | 2 ) = (a - b) = (| a | 2 ) + (| b | 2 ) - 2 a-b. The last term equates to 

-2 | a | | b_ | cos 9. This is the cosine rule, as required. 
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Combinatorics II 

This chapter discusses Ramsey theory, graph theory and topology. The principal principle of Ramsey theory is 
that ‘sufficiently large objects contain arbitrarily large homogeneous objects’. For example, Ramsey’s theorem in 
graph theory states that one can find arbitrarily large monochromatic cliques in a sufficiently large complete graph 
coloured with c colours. 


Gallai-Witt theorem 


■ Suppose we have a d-dimensional hypercube divided into g d elements, each of which is coloured with one of c 

colours. If g > G(d , c), where G is a function of d and c, then there exists some monochromatic (irregular) d-simplex 
homothetic to {(0, 0, 0, ..., 0), (1, 0, 0, ..., 0), (0, 1, 0, ..., 0), (0, 0, 1, ..., 0), ..., (0, 0, 0, ..., 1)}. [Lemma 1] 


Proof: 

To prove this, we induct on the number of dimensions. For d = 1, this is trivially true by the pigeonhole principle: 
G(l, c) = c + 1, as any set of c + 1 elements must contain two of the same colour. We can use this as a starting 
point for proving the case for d = 2. Firstly, we can guarantee the existence of things like this, known as (1, c, 1)- 
objects: 


We assume that the top vertex is a different colour to either of the bottom vertices of the triangle, since otherwise 
we are done. We consider a strip of squares, each of size G(l, c, 1) = G(l, c). They must each contain at least one 
(1, c, l)-object, like so: 


Moreover, each square can only have c G(1 ’ e,1)2 possible states. Consider a row of G(l, c G(1 ’ e,1)2 ) such squares. At 
least two of them must be identical, so we can guarantee the existence of things like this, known as (1, c, 2)- 
objects: 


All three rows of points must necessarily have different colours. We define G(l, c, 2) = G(l, c G(I ’ c,I)2 ) G(l, c, 1) 
to be an upper bound on the size of a box containing such an object. We then repeat the argument, considering a 
row of G(l, c G( 1 ,c,2)2 ) boxes of size G(l, c, 2): 


This gives us G(l, c, /+ 1) = G(l, c G(1,e ’^’~) G(l, c, /). Now consider (1, c, c)-objects. They must have c + 1 
rows of points, two of which must be the same colour by the pigeonhole principle. This means that we have a 
monochromatic isosceles right-angled triangle: 


In other words, G(2, c) = G(l, c, c). Now that we have tackled the two-dimensional case, we can begin work on 
three dimensions. We use the two-dimensional result G(2, c, 1) = G(2, c) as a base case, and perform an identical 
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inductive argument. Firstly, we can guarantee the existence of (2, c, l)-objects within a box of side length 
G(2, c, 1). 


4 


4 


Now, consider a plane of G(2, c G(2 ’ e, -f* 3 ) such boxes. They must necessarily contain an isosceles right-angled 
triangle of identical boxes, by the theorem for G(2, c), so we can guarantee a (2, c, 2)-object. 



Proceeding in this manner, we can guarantee the existence of a (2, c, c)-object, therefore a monochromatic 
tetrahedron. 


By repeating this argument and inducting on the number of dimensions, we can find upper bounds for G(d, c) for 
all integers d and c. This concludes the proof of Lemma 1. 


■ Suppose we colour the elements of Z" with c colours. Then, given a set of d + 1 vectors (0, ai, a 2 , ..., a rf ) c Z", we 
can find an integer AeN and vector v e Z" such that the points (v, v + A a\, v + Xa 2 , . .., v + A a rf ) are 
monochromatic. [Gallai-Witt theorem] 

This theorem also holds if Z" is replaced with Q" or IR". To prove the Gallai-Witt theorem, we ‘project’ Lemma 1 
from d dimensions onto a «-dimensional subplane by using a degenerate affine transformation. 



For example, the existence of the monochromatic triangle in the diagram above proves the existence of a set of 
reals homothetic to (0, a, b) on the line below. This argument generalises very easily to prove the Gallai-Witt 
theorem. 


1. Suppose c and n are integers. Prove that there exists an integer w = W(c, n) such that any c-colouring of the 
integers {1,2, ..., w{ contains a monochromatic arithmetic progression of length n. [Van der Waerden’s 
theorem] 
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By the pigeonhole principle, at least one of these c subsets has a ‘density’ 6 > -. A generalised version of van der 

Waerden’s theorem states that if s > S(6, n), then any subset A c {1, 2, ..sj with | A | > 6 s must contain an 
arithmetic progression of length n. This is known as Szemeredi’s theorem. Allowing 5 to approach infinity, we can 
apply this to the set of positive integers and obtain the following theorem: 

■ If some subset of the positive integers has a non-zero asymptotic density, then it contains arbitrarily long arithmetic 
progressions. 

Some other subsets of the positive integers contain arbitrarily long arithmetic progressions. Ben Green and 
Terence Tao proved that the prime numbers exhibit this property, despite having zero asymptotic density due to 
the prime number theorem. An unproven conjecture by Paul Erdos is that any set {a\, cm, a-}, ...} c N such that 
— + — + — + ... diverges to infinity contains arbitrarily long arithmetic progressions. Of course, Szemeredi’s 

a \ a 2 a 3 

theorem and the Green-Tao theorem are both special cases of this conjecture, since the prime harmonic series 
^ ^ ^ + ... diverges (albeit very slowly). 

Our argument gives very weak upper bounds for the van der Waerden numbers (minimal values of W(c, n)). By 
considering Szemeredi’s theorem, Tim Gowers currently has the tightest known upper bound, which is 

yt+9 

rye 2 

W(c, n) < 2 . The lower bounds are merely exponential, so very little is known about the asymptotics of van 

der Waerden numbers. 


Hales-Jewett theorem 


Observe that in no part of our proof of Lemma 1 did we use the entire Z". For the two-dimensional case, we only 
used a set of points corresponding to approximations to a fractal known as the Sierpinski triangle. The Sierpinski 
triangle is generated by beginning with a single point, then repeatedly placing three copies of it at the vertices of 
an equilateral triangle and scaling by -. Repeating this process four times, we obtain the order-4 approximation to 

the Sierpinski triangle, with 3 4 = 81 points (shown below). The Sierpinski triangle is the limit, when this process 
is repeated infinitely. 


We can associate points in the order-3 Sierpinski triangle with words of length 3 from the alphabet {A, B, Cl, like 


so: 
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ccc 

CCB CCA 

CBC CAC 

CBB CBA CAB CAA 

BCC ACC 

BCB BCA ACB ACA 

BBC BAC ABC AAC 

BBB BBA BAB BAA ABB ABA AAB AAA 

Suppose we introduce an additional symbol, *, which is considered to be a ‘variable’. A word containing at least 
one asterisk is known as a root. The root ABA * * C corresponds to the three words ABAAAC, AB ABB C 
and AB AC C C, where * successively takes on each of the three possible values. This set of three words is 
known as a combinatorial line. Note that combinatorial lines correspond to (upright) equilateral triangles of 
points in the Sierpinski triangle. 



More generally, we can associate points in the order-/? Sierpinski (;? - 1/-simplex with words from l!', where 2 is 
an alphabet of n symbols. 

■ Let I be an alphabet of n symbols. We colour each word of Z h with one of c colours. If h > H(n, c), then there exists a 
monochromatic combinatorial line. [Hales-Jewett theorem] 

So, it is equivalent to the following alternative formulation. 

■ Suppose we colour the vertices of an order-/? Sierpinski (n - l)-simplex with c colours. If h > H(n, c), there exists a 
monochromatic (upright) equilateral (n - l)-simplex. [Hales-Jewett theorem] 


Proof: 

The proof of Lemma 1 requires some slight refinement before it can be applied to prove Hales-Jewett. If we 
choose two generic points on the base of the Sierpinski triangle, then we cannot guarantee that there is a third 
vertex capable of completing the equilateral triangle. For example, BAB and ABA do not belong to a combinato¬ 
rial line. So, we cannot merely apply the pigeonhole principle to the 2 h points on the base of the Sierpinski 
triangle. Instead, however, we can apply it to the h + 1 points corresponding to ‘powers of two’. 
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If we select any two of the h + 1 circled vertices, there is a third point capable of completing the equilateral 
triangle. Applying the Pigeonhole principle, we can let h-c and there must be two circled vertices of the same 
colour. This proves the existence of (1, c, l)-objects, i.e. equilateral triangles with two vertices of the same colour. 


The remainder of the argument is identical to that of Lemma 1, so there is no need to repeat it here. 

The Hales-Jewett theorem has a generalisation. If we allow roots with two variable symbols, such as * and ©, 
then we can create a combinatorial 2-plane by considering the set of n 2 words formed by replacing each variable 
with each of the letters in 2. For example, if 2 = {A, B }, the root A * B B O A * would correspond to the combina¬ 
torial 2-plane {AABBAAA, ABBBAAB, AABBBAA, ABBBBAB). More generally, if we have roots 
with p variable symbols, then we generate a combinatorial p-plane of n p words. 

2. Let 2 be a finite alphabet of n symbols, and colour the words of 2 J with c colours. Prove that if 

j > J(n, c, p), then there exists a monochromatic combinatorial p-plane. [Generalised Hales-Jewett] 

Here is an example of a combinatorial 2-plane on the order-4 Sierpinski triangle: 

© 


© 0 


© 


0 


© © 


© © 


Just as van der Waerden’s theorem has a stronger ‘density’ version (namely Szemeredi’s theorem), so does the 
Hales-Jewett theorem and its generalisation. Using rather advanced methods, Furstenberg and Katznelson proved 
the theorem in 1991. More recently, a large collaborative effort (the Polymath project) led by Gowers resulted in 
an elementary combinatorial proof of the theorem, and thus Szemeredi’s theorem and its multidimensional 
extension (the density version of Gallai-Witt). 
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Noughts and crosses 


Effectively, the ordinary Hales-Jewett theorem states that in a c-player, /z-dimensional game of tic-tac-toe on a 
board of size n, where h > H(n, c), the game cannot terminate in a draw. Hence, one player has a winning strategy. 
The second player cannot have a winning strategy, as the first player can play randomly on the first move and 
emulate the winning strategy of the second player, knowing that owning an extra square cannot possibly be 
detrimental. This means that the first player can always win if the dimension is sufficiently large. 



For ordinary ‘noughts and crosses’ where c = 2, /; = 3 and h = 2, it is well known that neither player has a winning 
strategy. A typical drawing pattern is displayed above. It has been proved that H( 3, 2) = 4, so it is impossible to 
draw in a four-dimensional game of tic-tac-toe. This does not necessarily mean that three-dimensional games 
terminate in a draw; certain diagonal lines are not considered to be combinatorial lines. 

3. Does the first player have a winning strategy for c = 2, n = 3 and h = 3? [Three-dimensional noughts and 
crosses] 

4. A game is played between two players on a 1 by 2010 grid. Taking it in turns, they place either an S or an O 
in an empty square. The game ends when three consecutive squares spell out SOS, at which point the 
player who has just played wins. If the grid fills up without this happening, the game is a draw. Prove that 
the second player has a winning strategy. [Advanced Mentoring Scheme, November 2010, Question 2] 


Ramsey’s theorem 


Colour each of the edges of the complete graph K n either red or blue. Let /ClV, h) be the smallest value of n such 
that there must be either a red K r or blue K h contained within the graph. For example, /C(3, 3) = 6, as all colour¬ 
ings of K(, contain a monochromatic triangle, whereas the following colouring of K 5 does not: 
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5. Prove that + 1, b + 1) < RjO' + I. /?) + ASO, b + 1). [Bicoloured Ramsey’s theorem] 

This argument generalises. If we colour the edges of K„ red, blue and green, where n > A 2 (/\ g, b), then there must 

be either a red K r , a green K g or a blue K h . 

A further generalisation is by considering hypergraphs instead of graphs. Edges can be considered to be 
unordered pairs of vertices; if, instead, we colour unordered sets of k vertices, we obtain a complete k-hyper- 
graph. It transpires that Ramsey’s theorem generalises to hypergraphs. 

■ Let (Ci, C 2 , ..., C c ) be a set of c colours. Colour each unordered k -tuple of {1, 2, 3, . .., n) with one of 

(Ci, Ci, ..., C c }. Then, if n > R k (a\, < 22 , ..., a c ), there is some 1 <i< c and some subset of a t vertices, all ^-tuples of 

which are coloured with C,. [Generalised Ramsey’s theorem] 


Proof: 

We induct on the value of k. For k = 1, this reduces to the pigeonhole principle: 

/?i(fli, a 2 , ..., a c ) = 1 + (ai - 1) + (<22 - 1) + + (d c - !)• Suppose we are trying to prove the existence of 

Rk(a 1 , 02 , ..., a c ). Let n = 1 + Rk-i(R k {a\ - 1, 02 , ..., a c ), RiSfli, «2 - 1, ■ ■■, <z c ), ..., R k (ai, 02 , ..., a c - 1)). Select 
an arbitrary vertex V. By Ramsey’s theorem for (k - l)-hypergraphs, we can guarantee that there must be, for 
some colour C,, a set of b- Rk(a\, 02 , ..., a , - 1, ..., a c ) vertices {W], W 2 , ..., Wf,\ such that every unordered k- 
tuple containing V and k - 1 elements of {Wi, W, ..., W/,j is coloured with C,-. Amongst those b vertices, there 
must either be a set of aj vertices, all ^-tuples of which are coloured with C, (in which case we are done), or a set 
of aj - 1 vertices, all ^-tuples of which are coloured with C,. Consider those a, - 1 vertices together with V. All k- 
tuples of those a, vertices are coloured with C,-, so the inductive step is complete. As the base case 
Rk( 0, a 2 , ..., a c ) is trivial, we are done. 

6. Prove that, for every n > 3, there exists an integer k = K(n) such that every set S of k points in the plane in 
general position must contain a convex n-gon formed from n points of S. [Happy ending problem] 

This problem is so named as it lead to the eventual marriage of George Szekeres and Esther Klein. Klein was 
responsible for discovering that K( 4) = 5, and the result was generalised by Erdos and Szekeres. 


Dilworth’s theorem 


The case of Ramsey’s theorem for two colours and ordinary graphs gives exponential bounds on the number of 
vertices. With the base case of R(2, n) = n, it is evident that R(n, m) is bounded above by the binomial coefficient 
<n+m> - if we make additional constraints on how the edges are allowed to be coloured, then we obtain a much 

n\ mi 

stronger (indeed, optimal) bound. 

■ Suppose we define a relation t on the elements of a set S, such that, for all a, b, c e 5: 

■ a fc a; [Reflexivity] 

■ If a i b and b > a then a = b; [Antisymmetry] 

■ If a i b and bh c then a > c\ [Transitivity] 

Then, is known as a partial order on the elements of S. [Definition of partial order] 
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Common examples of partial orders include the relation a \ b on the set of positive integers and the relation a < b 
on the set of real numbers. 

■ If neither a fc b nor b a, then a and b are said to be incomparable. A set {ci, C 2 , c„) c S such that 

cj fc C 2 fc .. • c„ is known as a chain of length n. A set («i, a 2 , ..., a m ) c S such that a t and aj are incomparable for 
all i + j is known as an antichain of length m. [Definition of chains and antichains] 

We can interpret the elements of 5 as the vertices of a complete graph, joined with a blue edge if the elements are 
incomparable and a red edge otherwise. Then, Ramsey’s theorem guarantees that if | S | > R(n, m) < , there 

must be either a chain of length n or an antichain of length m. However, it is possible to prove much tighter 
bounds than those applicable to general graphs. 

7. Prove that if | S | > (n - 1) (m - 1) + 1, then there is either a chain of length n or antichain of length m. 

[Dilworth’s theorem] 

In general, (« - 1) (m - 1) + 1 is much smaller than . Note that Dilworth’s theorem cannot be improved 

n\ mi 

upon, as it is easy to define sets of (n - 1) (m - 1) elements where the longest chain is length n — 1 and the longest 
antichain is length m — 1. For example, here is a partially ordered set of 20 elements where there are no chains of 
length 6 or antichains of length 5. 



8. Show that a sequence of length n m - n - m has either a monotonically increasing subsequence of length n 
or a monotonically decreasing subsequence of length m. [Erdos-Szekeres theorem] 


Turan’s theorem 


Suppose we have a graph G of n vertices. If there exists a set of k vertices K = {Ai, A 2 , ..., A k ) c G such that 
every pair of vertices A t Aj is connected by an edge, then K is described as a k-clic/ue. For example, the following 
graph contains a 5-clique: 




What is the maximum number of edges G can have such that there are no (r + l)-cliques? For r = 1, there can be 
no edges, since an edge is a 2-clique. For r = n- 1, there can be | ? j - 1 edges, since we can delete a single edge 
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from the complete graph K„. For other r, it turns out that the maximum number of edges is uniquely achieved by 
the Turdn graph T(n, r), which is constructed by partitioning the n vertices into r subsets of almost equal size 
(differing by at most 1) and joining two vertices if and only if they inhabit different subsets. For instance, the 
tetrahedron-free graph on 10 vertices with the most edges is shown below: 



9. 21 apples are placed on the unit circle. Show that there are at least 100 line segments of length < V3~ with 
Rosaceae endpoints. [Ross Atkins, Trinity 2012] 

Again, this bound is attained by the configuration equivalent to the Turan graph, by separating the 21 apples into 
two groups of roughly equal size, situated near diametrically opposite points on the unit circle. 



10. Given nine points in space, no four of which are coplanar, find the minimal natural number n such that for 
any colouring with red or blue of n edges drawn between these nine points there always exists a 
monochromatic triangle. [IMO 1992, Question 3] 


Planar graphs 


We describe a graph as planar if it can be drawn in the plane without any edges crossing. For example, the 
complete graph K 4 is planar, whereas Ks and the complete bipartite graph W 33 are not. 
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Indeed, a graph is planar if and only if it contains either K$ or A 3 3 as a minor, i.e. can be reduced to one of these 
graphs by a combination of deleting edges, deleting vertices and contracting edges. For example, the (non-planar) 
Petersen graph below can be reduced to A 5 by contracting the five ‘shortest’ edges. 



A planar graph divides the plane into well-defined regions (or faces). The following graph has five faces, four of 
which are bounded. We have used four colours such that neighbouring faces are different colours; in general, this 
is possible with any planar graph. 



Indeed, it is best to append a point at infinity (this will become familiar to you later when we explore inversion) to 
convert the plane into a topological sphere. In this case, our planar graph is equivalent to the triangular prism. 
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There is a useful invariant applying to graphs drawn on some surface. Suppose we have the following two 
elementary operations: 

• A: inserting a vertex somewhere on an edge; 

• B: drawing an edge between two vertices, ensuring the graph remains planar. 

We are also allowed their inverse operations, A -1 and B l . The value of x = V + F - E (where V , F, E are the 
numbers of vertices, faces and edges, respectively) is referred to as the Euler characteristic , and is unaffected by 
these elementary operations. As we can obtain all planar graphs (or, equivalently, polyhedra with no holes) from 
these operations, then the Euler characteristic is constant. It is a trivial exercise to verify for a simple polyhedron 
(such as the tetrahedron, with (V, F, E) = (4, 4, 6)), that the Euler characteristic must be 2. 

Assuming no ‘funny business’ such as faces containing holes, the Euler characteristic is constant for all graphs 
drawn on a particular surface. Equivalently, it is constant for all polyhedra with a certain number of holes. For 
example, the Euler characteristic of a torus is 0. Every new hole decreases the Euler characteristic by 2, so by 
induction we have \ = 2 - 2 H, where H is the number of holes. More remarkably, an unbounded surface can be 
identified simply by its Euler characteristic and orientability (whether indirect isometries exist). For instance, the 
torus and Klein bottle are the orientable and unorientable surfaces, respectively, with an Euler characteristic of 
zero. 



11. The Klein quartic is a surface topologically equivalent to a multi-holed torus, which can be tiled by 24 
heptagons, where three heptagons meet at each vertex. What is its Euler characteristic, and thus how many 
holes does it have? 

This description of the Klein quartic may remind you of the {7, 3) tiling of hyperbolic space. Indeed, it is obtained 
by ‘rolling up’ a finite patch of the hyperbolic tiling into a surface in the same way that a chessboard (finite patch 
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of the square tiling {4, 4}) can be rolled up into an ordinary torus as in the diagram. Similarly, the Platonic solids 
are obtained by rolling up a finite patch (namely all) of a spherical tiling into a sphere. The Klein quartic has a 
symmetry group of order 336, far exceeding that of the most symmetrical Platonic solids (the icosahedron and 
dodecahedron, with 120 symmetries). 
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Solutions 

1. This is a trivial corollary of Gallai-Witt in one dimension. 

2. As the base case, J(n, c, 1) = H(n, c). For the inductive step, we let j = j(n, c nJM \ l). This means that 
there must be a monochromatic line of identical objects, each of which must contain a monochromatic k- 
plane. In total, this gives us a monochromatic (k + l)-plane. Note that this is a ridiculously fast-growing 
function. 

3. Yes. Place a ‘nought’ in the central cube. Assume the opponent plays a ‘cross’ in a cube C. Choose a cube A 
which is not collinear with C, and place a ‘nought’ in the cube diametrically opposite to A. This forces your 
opponent to place a ‘cross’ in A. As the two ‘crosses’ are non-collinear, you now have a free move. Place a 
‘nought’ in a position coplanar with your existing two ‘noughts’. This creates two partial lines; your 
opponent can only block one of them. 

4. Define an ‘unsafe move’ to be one that results in the opponent winning on the subsequent move. The only 

unsafe move for placing an O is S _-> S O Define an ‘unsafe square’ to be one where placing either an 

S or an O is an unsafe move. The only unsafe squares are of the form S _ S , which occur in pairs due to the 

bilateral symmetry. Hence, there is always an even number of unsafe squares. So, if there is a nonzero 
number of unsafe squares, the second player has a winning strategy as her opponent must eventually place a 
letter in an unsafe square, resulting in a win for the second player. To force a win, therefore, she must 

simply create an arrangement of the form S _ S. Immediately after the first player moves, the second player 

places a S sufficiently far from the first move. If the first player tries to block by playing close to the S, 

simply place another S on the opposite side, resulting in S _ S. As soon as the first player makes an unsafe 

move (which he inevitably will), the second player can immediately win. 

5. Consider a vertex V from the graph. It must have either at least R(r, b + 1) red edges or R(r + 1, b) blue 
edges connected to it. Assume, without loss of generality, that the former is true. Consider the set S of 

R(r, b + 1) vertices connected to V via red edges. The subgraph on the vertices of S must then either contain 
a blue Kb + i (in which case we are done) or a red K r . V is connected to each of the vertices of the K r by a red 
edge, resulting in a red K r+l . 

6 . It is straightforward to show that K( 4) = 5, i.e. that every set of five points contains a convex quadrilateral, 
by considering all possible diagrams. This acts as a ‘base case’ to apply Ramsey’s theorem for K(n). Colour 
each 4-tuple of points blue if they are convex, or red if they are non-convex. By Ramsey’s theorem for 
R 4 (5, n), there must be either a set of n points that form a convex polygon or a set of 5 points, no 4 of which 
form a convex polygon. Due to the base case, the latter is impossible, so the former must invariably be true. 

7. Let / : 5 -> Z + be a function mapping each element of S to the length of the longest chain terminating in S. 
If there are no //-chains. then the values may only range from 1 to w — 1. Similarly, if there are no / 77 - 
antichains, then only m - 1 elements are allowed to take each value. So, there are at most (« - 1) (m - 1) 
elements in S. 

8 . Consider the sequence {ai, « 2 , ■ • For each a, and aj, we say that a, > aj if both i > j and a, > cij. 

Then, Dilworth’s theorem guarantees that either a chain (increasing subsequence) or antichain (decreasing 
subsequence) exists. 

9. Let G be the graph on 21 vertices, where two vertices share an edge if and only if they are separated by a 
distance greater than y/~3~. As any three apples must form the vertices of a triangle, and one angle must be at 
most ^, we can use the sine rule to deduce that one of the edges must be less than or equal to V3~- Hence, 

(21 j 

G is triangle-free and has at most 110 edges by Turan’s theorem. As we have a total of I I = 210 pairs of 
apples, there are at least 210 - 110 = 100 line segments of length < V^T. 
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10. To prove that n > 32, consider the Turan graph above on nine vertices containing no 6 -cliques. It has 32 
edges. We call the five subsets of vertices A t , A 2 , A 3 , A 4 , A 5 , and join vertices X e A, and Y e Ay with a red 
edge if (i - j) = + 1 (mod 5), a blue edge if (i - j) = +2 (mod 5), and no edge if i = j. To prove that n < 33, 
note that all graphs with 33 edges must contain a 6 -clique by Turan’s theorem. This 6 -clique must contain a 
monochromatic triangle by Ramsey’s theorem. 

11. Each vertex is adjacent to three heptagons, and each heptagon has seven vertices. Hence, there must be 

24 =56 vertices. Similarly, each heptagon has seven edges, and each edge is adjacent to two heptagons, 

so there are 84 edges. The Euler characteristic is 24 + 56 - 84 = -4, so the Klein quartic is topologically a 
three-holed torus. 
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Polynomials 

The set of polynomials {/(x) = a a + ai x + a 2 x 2 + ... + a n x"}, where n is a non-negative integer and 
{ao> a u ■ a n 1 c S, is denoted Six] (pronounced ‘S adjoin x’). In this chapter, we will explore the cases where S 
is the set of complex numbers or real numbers. 


Complex polynomials, C[x] 


Suppose we have a degree-/! polynomial f(x) = ciq + a\ x + x 2 + ... + a n x", where a o, .. a n are complex 
constants and a n is non-zero. According to the fundamental theorem of algebra, we can express it as a product of 
linear factors of the form x - a,, where a, is a (complex) root of the polynomial. 

■ If we have a monic polynomial f(x ) = ag + «i x + a 2 x 2 + ... + a n x", where a n = 1, then we can express 
f(x) = (x - ct\) ( x - a 2 ).. .(x - a n ), where a\, ..., a„ are (not necessarily distinct) roots of the polynomial. 

[Fundamental theorem of algebra] 

For example, the polynomial x 4 - 1 can be factorised as (x - 1) (x + 1) (x + i) (x - i), where i = V -1 is the 

Ini 

imaginary unit. In general, the polynomial x n - 1 = (x - 1) (x - r) (x - r 2 ).. .(x - r" _1 ), where r = e~ is a princi¬ 
pal nth root of unity. The roots of unity are positioned at the vertices of a regular n-gon with centre 0 and a vertex 
at 1. The example for x 5 - 1 is shown below. 


4 in 

e J 

2 in 

e 5 


i 

4l7T 


e 5 



2 i n 


e 5 


This means that the degree-/! curve y = f(x) meets the degree-1 line y = 0 in at most n points. There is nothing 
special about the line y = 0, and this also applies to any line. More remarkably, the polynomial curve can be 
replaced with any algebraic curve (such as the unit circle, x 2 + y 2 = 1, which has degree 2). Even more generally, 
where the line is replaced with another algebraic curve, we have Bezout’s theorem. 

■ Suppose P and Q are two curves of degrees m and n, respectively. If they intersect in finitely many points, then they 
intersect in at most m n points. [Bezout’s theorem] 

Equality occurs if we consider intersections on the complex projective plane (instead of the real plane), and count 
intersections with appropriate multiplicity ( e.g. twice for tangency, thrice for osculation et cetera ). The complex 
projective plane is discussed in later chapters, and it is only necessary at this point to use the weak form of 
Bezout’s theorem. 
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1. Show that two ellipses intersect in at most four points. 

2. Consider the regular n-gon with vertices A\, A2, ..., A„, where n > 5. Let P be a variable point on the 
circumcircle of the «-gon. Show that the value of f(P) = A\ F y4 + At P 4 + ... + A n P 4 remains constant. 


Difference of three cubes 


A particularly useful polynomial is x 3 + y 3 + z 3 - 3 x y z. Over the reals, it factorises into 

(x + y + z) [x 2 + y 2 + z 2 - x y - y z - z xj, and the quadratic term can be further factorised over the complex 

numbers. This polynomial recurs in many situations, including olympiad problems. 


The polynomial x 3 + y 3 + z 3 - 3 x y z = det 


(x y z 

Z x y 


= (x + y + z) (x + ca y + a) 2 z) (x + cl> 2 y + a/ z), where lj is a 


y z x 

principal cube-root of unity. [Difference of three cubes] 


This generalises to n variables, instead of merely three. Indeed, the name is derived from the n = 2 case, known as 
the ‘difference of two squares’, x 2 - y 2 - det| ^ j = (x + y) (x - y). 


Xo 

Xl 

x 2 ■■■ 

■*71-1 


Xfi—1 

xo 

X! ■■■ 

•*71-2 

n-1 

X„-2 

Xn-1 

xo ■■■ 

-*71-3 

= n 





r= 0 

Xl 

X2 

X 3 ••• 

*0 , 




The sums in the product on the right-hand side are the terms of the discrete Fourier transform of 
{xo, x\, ..., x„_i]. (Continuous) Fourier transforms were originally discovered to explain how the sound of an 
entire orchestra can be composed of basic sinusoidal waves. Today, this idea is used to analyse electrical circuits. 
The discrete Fourier transform can be computed quickly using certain algorithms, forming the basis of the fastest 
known algorithm for multiplying two large integers. 

3. Find the minimum value of x 2 + y 2 + z 2 , where x, y, z are real numbers such that x 3 + y 3 + z 3 -3xyz=l. 

[BM02 2008, Question 1] 


Cubic equations 


For sufficiently small degree, it is possible to solve polynomial equations using radicals (nth roots). For the 
general quadratic equation, the Babylonian technique of ‘completing the square’ suffices. Solving the cubic 
equation is a more difficult, multi-step process. If we can immediately find a root a, it is possible to divide by 
x - a to reduce the equation to a quadratic. Otherwise, more ingenious techniques are required. 

4. Suppose we have an equation ay 3 + by 2 + cy + d = 0. Show that this can be converted to an equation of 
the form x 3 + p x + q = 0, where x is a linear function of y. [Reduction to a monic trinomial] 

Hence, it is only necessary to consider the latter case, as all other cubics can be reduced to it. 

5. Show that x 3 + p x + q = 0 is equivalent to x 3 + a ' + if - 3 a b x = 0, where a 3 and b 3 are the roots of the 

, „ 3 

quadratic equation z — q z — — = 0 . 


6 . Hence show that x 3 + p x + q = 0 has a root 
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x = -(a + b) = 




[General solution to cubic 


equations] 


The general solution to the cubic equation was the quest of many mediaeval mathematicians. After a partial 
solution by Omar Khayyam, the first complete solution was by Niccolo Tartaglia. However, when the quartic was 
later solved by Lodovico Ferrari, Tartaglia’s solution of the cubic was mistakenly attributed to Gerolamo Car¬ 
dano, and is thus referred to as Cardano’s formula. This displeased Tartaglia to a great extent. 

Due to the existence of formulae for solving the general quadratic, cubic and quartic equations, people imagined 
that there might be similar algebraic methods for solving any polynomials using radicals. However, this is not the 
case. Galois theory demonstrates that it is impossible to solve the general quintic, and of course polynomial 
equations of higher degree. 



7. Two parallel planes cut the sphere of unit radius into three equal volumes. Find a cubic equation in rational 
coefficients, one root of which is the separation d between the two planes. 


Symmetric polynomials 


So far, we have mainly considered polynomials in one variable. The difference of three cubes was an exception, 
as it featured three variables. Indeed, it is what is known as a symmetric polynomial, as interchanging any two of 
the variables leaves the polynomial unchanged, x 2 - y 2 is not symmetric, since interchanging x and y negates the 
value, rather than preserving it. (x - y) 2 , however, is symmetric. 

8 . Suppose we have a symmetric polynomial in two variables, x and y. Show that it can be expressed as a 
polynomial in.? and p, where s = x + y and p = x y. 

This is a special case of Newton’s theorem of symmetric polynomials: 

■ Any n-variable symmetric polynomial in x\, X 2 , x n can be expressed as a polynomial in the elementary symmetric 
polynomials (ESPs), i.e. coefficients of (x - x\) (x - X 2 ) ...(x- x n ). [Newton’s theorem of symmetric polynomials] 


Proof: 

Note that any symmetric polynomial can be multiplied out to yield a sum of terms of the form 
k 2 (xf l xf 11 ... x„ an ), where the sigma indicates a symmetric sum. We will represent this as k f(a 1 , a 2 , ..., a n ). 

sym 

We assume without loss of generality that a\ > as > ... > a n , and lexicographically order the terms. (Specifically, 
f{a\, a 2 , ..., a n ) precedes f{b\, bs, b n ) if a\ < b\, or a\ = b\ and <72 < ^ 2 , or rq = b\ and a 2 = ^2 and <23 < b^, et 
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cetera.) We then proceed via an inductive argument. 

For a given term k f(a\, < 22 , ■ • a n ) of the ordering, assume that all preceding terms can indeed be expressed as 
polynomials in the ESPs. Suppose that a\=ai= ... = a/, > a h+\ • We subtract the symmetric polynomial 
k /(1, 1, 1, 0, 0) f(a\ - 1, «2 - 1, • a/i ~ 1, a/i+ 1 , < 2 / 1 + 2 , •••, d„). Since /(1, 1, 1, 0, 0) is already 

an elementary symmetric polynomial, and f(ci\ - 1 , 02 - 1 , ..., a fl - 1 , a^+i, a/,+ 2 , «„) is a symmetric polyno¬ 

mial of lower degree than the original expression, the term we have subtracted can be expressed as a polynomial 
in ESPs. The remainder exclusively contains terms that precede f(a\, < 22 , «„), thus can also be expressed as a 

polynomial in the ESPs. 

9. Given real numbers a, b, c, with a + b + c = 0, show that a 3 + b 3 + c 3 > 0 if and only if a 5 + b 5 + c 5 > 0. 

[BM02 2004, Question 3] 
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Solutions 


1. Ellipses can be represented by quadratic equations in x and y (like all conic sections). As a consequence of 
Bezout’s theorem, they can intersect in no more than 2x2 = 4 points. 

2. Consider an arbitrary point Q in general position on the circumcircle of the n-gon, and consider the curve 
f(P) = /(G)- It is a quartic curve (by definition) and must intersect the circumcircle in 2 n points (rotations 
and reflections of Q). Due to Bezout’s theorem, a quartic and circle sharing no common component can 
only intersect in at most 8 points; however, 2 n > 8 , so the quartic must contain the circle. Hence, all points 
on the circle satisfy f(P) = f(Q). 


3. x 3 + y 3 + z 3 - 3 x y z = det 


is the volume of the equilateral parallelepiped with vertex 


(x y z ' 
z x y 
y z x 

O = (0, 0, 0) and adjacent vertices A = (x, y, z), B = (y, z, x) and C = (z, x, y). Suppose we fix 

x 2 + y 1 + z 2 = r 2 , the square of the side length, instead of fixing the volume. If O, A and B are constrained to 

lie in a plane, then the volume is maximised when O C is a normal to this plane. Hence, the volume is 
maximised relative to the side length when the parallelepiped is a cube with volume r 3 . As such, the 
minimum value of r 2 for a parallelepiped with unit volume is 1 , when {x, y, z} = { 1 , 0 , Of 


4. Firstly, we divide by a to obtain x 3 + - x 2 + - x + - =0. Let y = x + —. Then, our cubic becomes 

' J a a a 3 a 


(y - j-f + - (y - j-)" + y - + ~ = 0. By using the binomial expansion of the first two terms, the 

coefficients in y 3 and y 2 are one and zero, respectively. 


b \2 


5. We let q = a 3 + b 3 and p = -3 a b. a 3 and b 3 are the roots of (z - a 3 ) (z - £> 3 ) = 0, which expands to 


z 2 - (a 3 + b 3 )z + a 3 b 3 = 0. We already have a 3 + b 3 =q, and a 3 b 3 = ( a b ) 3 = ( - ) 3 = -{ 


6 . x 3 + a 3 + b 3 - 3 a b x = (x + a + b) (x + u> a + to 1 b) (x + or a + to b) = 0 has roots x = -(a + b), 

x = -(<x> a + to 1 b) and x = -( to 2 a + to b ). Using the Babylonian formula for solving the quadratic equation 
gives us the values of a 3 and b 3 , whence we can obtain a and b by cube-rooting. 

d d 

7. The volume enclosed by the two planes is given by f 2 d it y 2 dx = f 2 d n( 1 - x 2 ) dx 

2 2 

1 ( d 3 \ 4 

- of the total volume of the sphere, we have n\d - — 1 = - n. This simplifies to d 
of which must be the separation between the planes. 

8 . This is a special case of Newton’s theorem of symmetric polynomials, which is proved later in the chapter. 


= n(d - ~ For this to be 
-12 d + — = 0 , one root 


9. We have a 3 + b 3 + c 3 = 3 a b c, by the difference of three cubes. Also, a 2 + b 2 + c 2 = -2 a b - 2 b c - 2 c a, 
since (a + b + c ) 2 - 0. We can express a 5 + b 5 + c 5 as (a + b + c) ( a 4 + b 4 + c 4 ) - £ a 4 b. The first term is 

sym 

zero, so a 5 + b 5 + c 5 = - 2 « 4 b = a 3 b c + b 3 c a + c 3 a b - (< 3 3 + b 3 + c 3 ) (a b + b c + c a). After factorisation, 

sym 

this is then equal to a b c(a 2 + b 2 + c 2 ) - 3 a b c(a b + b c + c a), or | ab c(a 2 + b 2 + c 2 ). Except in the trivial 

case where all variables are zero, a 2 + b 2 + c 2 > 0 , so a 3 + b 3 + c 3 and a 5 + b 5 + c 5 are positive real multiples 
of a b c, thus both have the same sign. 
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Sequences 

In this chapter, we are concerned with infinite sequences of either integers or, more generally, real numbers. 
Although it is no longer one of the main phyla of questions in the IMO (combinatorics, algebra, geometry and 
number theory), sequences do feature very prominently. 


Generating functions 


To manipulate sequences, it is useful to be able to represent them algebraically as a power series known as a 
generating function. 

■ A sequence {Ao, Ai, Ai, A 3 , ...) has the ordinary generating function a a (x) = Ao + A\ x + A 2 x 2 + A 3 x 3 + .... 

[Definition of OGF] 

We can add and multiply ordinary generating functions, which correspond to addition and convolution of their 
respective sequences. 

1. Suppose we have two sequences, {A„] and \B n ), with ordinary generating functions a 0 (x ) and b 0 (x), 
respectively. Let {C„| and \D n } have ordinary generating functions a„(x) + b 0 (x) and a 0 (x) b a (x), 
respectively. Show that C n = A n + B n and D n = Aq B n + A\ B n t + ... + A„ Bq. [Addition and convolution] 

2. Find a closed form for 1 + x + x 1 + ..., the ordinary generating function of {1, 1, 1, ...}. Hence find an 
ordinary generating function for the sequence of natural numbers, {1, 2, 3, ...}, and the triangular numbers, 

{1, 3, 6 , 10, ...). 

3. Suppose {Ao, Ai, A 2 , A 3 , ...} has ordinary generating function ajx). What sequence has ordinary 
generating function y a c (x)? 

4. Hence find the sequence with ordinary generating function ln( I - x). 

When differentiating Ao + Ai x + A 2 x 2 + A 3 x 3 + ..., we obtain the sequence A 1 + 2 A 2 x + 3 A 3 x 2 + 4 A 4 x 3 + .... 
As an operation on sequences, this is a left shift followed by a (somewhat annoying) multiplication of each term 
by a different scalar. We can remove this inelegance by defining a more complicated exponential generating 
function, or EGF. 

■ A sequence {Bo. B\, B 2 , B 3 , ...) has the exponential generating function b e (x) = y + yx + yx 2 + yx 3 + .... 

[Definition of EGF] 

If we differentiate it, we obtain the exponential generating function of the sequence { B \, Bo, B 3 , ...}, which is 
simply the original sequence shifted to the left. The sequence {1, 1, 1, ... ] is invariant when shifted to the left, so 
its exponential generating function (namely /(x) = e* = 1 + x + ^ x 2 + yx 3 + y x 4 + ...) is invariant under 
differentiation, and is thus a solution to the differential equation /(x) = / ' (x). 

5. Show that the exponential generating function of {1, 0, -1,0, 1,0, -1,0, ...] is a solution of the 
differential equation /" (x) + /(x) = 0 . 

This type of differential equation is encountered in the mechanics of mass-spring systems. This particular solution 
is the function /(x) = cos x; the general solution is /(x) = A cos x + B sin x, corresponding to the sequence 
{A, B, -A, -B, A, B, -A, —B, ...]. For this reason, a stretched spring with a suspended mass will oscillate 
periodically in simple harmonic motion. 
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Some basic sequences and their exponential generating functions are given below. 


Sequence 

EGF 

{ 1 , 1 , 1 , 1 , ...} 

e x 

{ 1 ,- 1 , 1 ,- 1 , 1 , ...} 

e~ x 

{ 1 , 0 , 1 , 0 , 1 , ...} 

cosh(x) 

{ 0 , 1 , 0 , 1 , 0 , ...} 

sinh(x) 

{ 1 , 0 ,- 1 , 0 , 1 , 0 , - 1 , 0 , ...} 

cos(x) 

{ 0 , 1 , 0 ,- 1 , 0 , 1 , 0 , - 1 , ...} 

sin(x) 

{1, 2, 4, 8 , 16, ...} 

e 2 * 

{1,2, 3, 4, 5, ...} 

e x x 


6 . Find a sequence {Fo, F\, Ft, ...! whose exponential generating function satisfies the differential equation 

/"(*) = /'(•*) +/CO- 

In general, the solution to any homogeneous linear differential equation is the exponential generating function of 
a sequence defined by a linear recurrence relation. 


Linear recurrence relations 


Suppose we have a sequence defined by the linear recurrence relation 
A n+ k = a 0 A n + O'! A „ + 1 + a 2 A n+2 + • • • + a k -\ A n+ k~\. This is linear and homogeneous, which means that for any 
two sequences \A n J and \B n ] satisfying the equation, so does the sequence {AA„ + pB n ). It is also determined 
entirely by the values of {Ao, A 1 , ..., A k \ j, so there are k degrees of freedom in the solution set. 

If a sequence { B„ = Jt"[ satisfies the recurrence relation, then we have x / 1 = a^ + a\ x + a 2 x 2 + ... + a k -\ x* -1 . By 
the fundamental theorem of algebra, we can rearrange and factorise this into k linear terms. Assuming that this 
polynomial has l distinct roots, Pi, Pi, ■■■, Pi, we obtain the general solution 
\A„ = P\{n) Pi" + P 2 (n) p 2 n + ••■ + F/(«) /?/'}. F, is a polynomial of degree m, - 1, where m, is the multiplicity of 
the root fp. Specifically, when all the roots are distinct, all values of P,(n) are constants. It is possible to verify 
that each term satisfies the linear recurrence relation, so the general solution is valid. Also, it has k degrees of 
freedom, so there can be no other solutions. 


Probably the simplest non-trivial linear recurrence relation is the Fibonacci sequence {F„} with Fq = 0, F { = 1 and 
F „ + 2 = F n+ 1 + F„. It was discovered by Leonardo of Pisa when contemplating a problem about the exponential 

growth of a rabbit population. It has the closed-form expression F n = ~~, where tp and ip are the positive and 

negative roots, respectively, of the equation x 2 = x + 1 . 


VT 


, where <j> ■ 


l+Vs” 


and ip - 


l-VT 


[Binet’s formula] 


This enables one to compute a closed form for the exponential generating function of {0, 1, 1, 2, 3, 5, 8 , ...], and 
thus find a closed-form solution to the differential equation f"(x) = f' (x) + f(x). 
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7. Find a closed form for the exponential generating function of {0, 1, 1, 2, 3, 5, 8 , ...). 

8 . Show that, for all a, b e Z + , we have F a \ F a b- 

9. Prove that F „ 2 = F n+l F„ t - (-1)". [Cassini’s identity] 

10. Show that the Fibonacci sequence has the ordinary generating function f a (x) = —-—-, and thus find a 

1 -x-xr 

rational approximation to 0.01010203050813213455. 

Every sequence generated by a linear recurrence relation has an ordinary generating function expressible as a ratio 
between two polynomials. 

Consider the sequence {A, (mod p)}. As the sequence is generated by the previous k terms, each of which can be 
one of n values, the sequence must eventually cycle with period P <n k . The recurrence relation is deterministic in 
both directions, so it must be completely periodic. Indeed, this bound can be slightly improved upon, as k consecu¬ 
tive zeros would result in a sequence that is identically zero. Hence, P <n k - 1. Sequences where equality holds 
are known as maximal. For example, the non-zero sequence with recurrence relation X n+ j, = X „ +1 + X n (mod 2) has 
a period of 2 3 - 1 and is displayed below (where red and blue discs indicate 0 and 1 , respectively). 


In a maximal sequence, every non-empty string of k digits appears precisely once in each cycle. Together with the 
property that the sequence has a long period, linear recurrence sequences are used as simple pseudo-random 
number generators. A refined algorithm, known as the Mersenne twister, has a period of 2 19937 - 1 and is almost 
indistinguishable from random data. 

11. Let n be an integer greater than 1. In a circular arrangement of n lamps L (h L t , ..., L„ t , each one of which 
can be either ON or OFF, we begin with the configuration where all lamps are ON. We carry out a sequence 
of steps, Step 0 , Step[, .... If Lj_\ is ON, then Step ; changes the status of L, (from ON to OFF or vice- 
versa), but does not change the status of any other lamps; if Lj_ | is OFF, then Step, does not change 
anything at all. Show that: 

■ There is a positive integer M(n) such that all lamps are ON again after M(n) steps. 

■ If n has the form 2 k , then all lamps are on after ir - 1 steps. 

■ If n has the form 2 k + 1. then all lamps are on after n 1 —n + 1 steps. [IMO 1993, Question 6 ] 


Positional systems 


When we write numbers in decimal notation, we let ‘<4<4-t ■ • ■ ^2 d\ do’ represent the non-negative integer 
d k 10 A + dk -1 10 A_1 + ... + \0d\ + do, where each digit d t e [0, 1, 2, 3, 4, 5, 6 , 7, 8 , 9[. Note that this is unique; 
an integer cannot have two valid representations. This can be generalised to other bases (or radices) such as 
binary and ternary. For example, we can uniquely express any non-negative integer in binary as a sum of distinct 
powers of two. 

■ The n-ary representation L d k d k _ l ... d 2 dido’, where each digit d, e (0, 1, ...,/;- 1), uniquely represents the integer 

E n'di. 

i =0 

A certain type of functional equation, known as a binary functional equation or Monk equation, is best 
approached by expressing the numbers in binary. 
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12. A function / : Z>o R is defined using the initial term /( 0) = 0 and the recurrence relations 
/(2 ri) = ^ /(«) and /(2 n + 1) = 1 + /(2 «). How many integers x exist such that x = 2 20 /(x)? 

There are some more interesting variants of positional systems. Balanced ternary has n = 3 and d, e {- 1 , 0, 1), as 
opposed to the {0, 1, 2j of ordinary base-3. There exist unique expressions in balanced ternary for every integer, 
as opposed to merely non-negative integers. By contrast with the modern binary computers we use today, there 
was an early computer (Setun), which operated in balanced ternary. 

The system with n = i - 1 and d, e {0, 1 j is even better, as it can represent any Gaussian integer. The Gaussian 
integers with representations using at most k + 1 digits are the 2 k+l points on a space-filling fractal known as the 
twindragon cun’e. The example for k = 8 is shown below, with the origin encircled red. 


© 


13. Prove that every Eisenstein integer has a unique representation in the positional system with base n = a>- 1 
and digits {0, 1, 2}. 



..v 


• AV.V 

. ... !»»>• :>>>>>>>>• 

; v.. ; v. A _; v.w.. _; y.y.y.v.. 

’ v v. . * ‘•"wwXwXw/Xw.. ' "•v/z/av/XvIvIvIvIv. . 

,v.v. . • V.V.V.V.V.V.V.V.V.V.V.*. . • V.V.V.V.V.V.V.V.V.V.V.*.. • v.v.v.w.v.v.v. 

•••••• •••••••••••••••••••••••• •••••••••••••••••••••••• ••••- 

'v.w.. ‘v.v.v.v.v.v.v.v.*.. ••••.. • v.v.v.v.v.v.v.v.*.. ••••.. 

. . .V.V.V... .V * . .v.v.v.v.v.v.v.v.v. .. .*.* • . .v.v.v.v.v.v.v.v.*.*. . . .V ■ 




A particularly interesting positional notation is the Zeckendorf representation. There is a unique way to express 
any non-negative integer as the sum of Fibonacci numbers, no two of which are consecutive. For example, 
100 = 89 + 8 + 3. 


14. Determine whether there exists a function / : N -» N such that /(/(«)) = /(«) + n for all n e RI. 


15. A set A of integers is called sum-full if A c A + A, i.e. each element a e A is the sum of some pair of (not 
necessarily distinct) elements b, c e A. A set A of integers is said to be zero-sum-free if 0 is the only integer 
that cannot be expressed as the sum of the elements of a finite non-empty subset of A. Does there exist a set 
of integers which is both sum-full and zero-sum-free? [EGMO 2012, Question 4, Dan Schwarz] 
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Catalan sequence 


We define the «th Catalan number, C„, to be the number of ways of pairing 2 n points on the circumference of a 
circle with n non-intersecting chords. For example, we have C 3 = 5: 



Each of these pairings divides the interior of the circle into n + 1 regions. We can place an ‘observer’ in any 
region. Now label each point with a L or R depending on whether it is connected to a point to the left or right of 
itself when viewed by the observer. For example, the first pairing can lead to any of the following labellings, 
depending on where the observer is positioned: 



As every labelling of 2 n points with n Ls and n Rs can result in a valid pairing and position of observer, and every 
pairing and position of observer results in a unique labelling, we have a bijection between the two. As there are C„ 
pairings and n + 1 regions in which to place the observer, there are (n + 1) C„ different labellings. However, there 

(2n\ 1 (2n 

are clearly I I different labellings, so we have C„ = — I 


16. Show that the number of valid balanced strings S of 2 n parentheses is given by C„. (There must be n left 
parentheses, n right parentheses, and for all k <2 n, the number of right parentheses in the first k symbols of 
S cannot exceed the number of left parentheses.) 

17. An insect walks on the integer lattice Z 2 , beginning at (0, 0). After 2 n steps, it reaches (n, ri). 

■ How many different paths could the insect have taken? 

■ Assuming that, for all points (x, y) on the path, x> y, how many different paths are possible? 

18. Prove that C n+ \ = Co C„ + C\ C„_i + C 2 C, 2 + ... + C„ Co- [Segner’s recurrence relation] 


19. Let c a (x) = Co + Ci x + C 2 x 2 + ... be the ordinary generating function for the Catalan numbers. Show that 
xc a (x) 2 - c 0 (x) + 1 = 0 , and thus that c a (x ) = 1 ^ | 1 ~ 4a . 


L-systems 


Suppose we have a string S consisting entirely of the symbols A and B. We define /'(.S') by simultaneously replac¬ 
ing every A with AB and every B with BA. For example, starting with a single A, we have /(A) - A B, 
/(/(A)) -ABBA, f 3 (A) = A B B A B A A B, and so on. The limit of this is an infinite sequence 
ABBABAABBAABABBA..., known as the Thue-Morse sequence. This process of repeatedly applying 
substitution rules to every symbol in a string is known as an L-system. 

20. A sequence jjq] is defined by = 1 and the recurrences X 2 k = - x^ and JC 2 a —1 = (-1)^ +1 *k (for all k e Z + ). 
Prove that, for all n > 1, we have +1 + x 2 + ... + x„ > 0. [IMO 2010 shortlist, Question A4] 
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21. Prove that we can divide the interval [0, 1] into finitely many intervals, alternately coloured red and blue, 
such that J. ed F(x) dx = P(x) dx for all polynomials P of degree 2013. 



One of the more interesting L-systems is the golden string, generated by iterating the system S -> g(S) with 
substitution rules XX Y and Y -» X to the initial string X. The first few iterations are 
X->XY->XYX-*XYXXY->XYXXYXYX-*.... 

22. Show that the number of symbols in g"(X) is given by F n +2, i-e- the (« + 2)th term of the Fibonacci 
sequence. 

23. Prove that g n (X) can be expressed as the concatenation of two palindromic substrings for all n e Z + . 
Moreover, find the lengths of the palindromic substrings for all n > 3. (A string is described as palindromic 
if it reads the same in both directions. For example, LEVEL and RAC EC AR are palindromic strings. 

The empty string is also considered to be palindromic.) 

24. Flence prove that, for all n e Z + , removing the last two symbols of g n (X) results in a palindromic string. 

The golden string may be regarded as a one-dimensional analogue of the Penrose tiling. Indeed, if you know 
where to look, you will be able to find the golden string recurring throughout any Penrose tiling. Additionally, the 
last digit of the Zeckendorf representation of n (for all non-negative integers n) forms the golden string. 


Farey sequences 


Suppose we have two tangent circles resting on the real line. Circle Y i is positioned at — and has a diameter of 
. Similarly, circle Ti is positioned at — and has a diameter of . 

9l 2 ?2 ? 2 - 


25. Prove that pi + 1 = P 2 <?i- 
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We now position a smaller circle, r 2 , externally tangent to the two larger circles and the real line. 

26. Show that T 2 is tangent to the line at — = P,+P2 . 

? l +?2 

This gives a geometrical relationship between the radii of the circles. 

■ Suppose circles T\ and T 2 are tangent to each other, and one of the outer common tangents is 1. Let be a third circle 

tangent internally to Ti, r 2 and I. Then —— = —-—r ——, where r,- is the radius of T,-, [Sangaku problem] 

VTT Vo - V7T 

If we begin with a circle of unit diameter for each positive integer, and iterate this process infinitely, we create a 
pattern known as the Ford circles. This has an elegant symmetry associated with modular forms and certain 
tilings of the hyperbolic plane. 



Taking only the circles where q < n and confining ourselves to the interval [0, 1], we generate a Farey sequence, 
F n . For example, F$ = {0, ^±^|^^, l}. Each term is the mediant of the two neighbouring terms. 

For instance, ^ is situated between ^ and |, and ^ = -j' “. This enables a Farey sequence to be extrapolated in 

both directions from two adjacent terms. It turns out that Rademacher’s proof of the formula for the partition 
numbers involves Ford circles and Farey sequences. 

Returning to the Sangaku problem, there is a generalisation known as Descartes’ theorem, named after the 
philosopher who said ‘cogito ergo sum’ and invented Cartesian coordinates. If we have four circles, which are 
pairwise externally tangent, then there is a quadratic relationship between the reciprocals of the radii. 

■ (—+—+—+ — ) =2 + -^ + —A [Descartes' theorem] 

Vn r 2 r 3 r 4 / V /i r, r, r 4 I 

— is thus the solution to a quadratic equation. If we choose the other root (and multiply by -1 to make it posi- 

r 4 

tive), we obtain the reciprocal of the radius of the circumscribed circle, rather than the inscribed circle. 


Egyptian fractions 


The Fibonacci sequence, Catalan sequence and powers of two grow reasonably quickly, namely exponentially. By 
comparison, Sylvester’s sequence grows even more quickly ( doubly-exponentially ), with the first few terms being 
{2, 3, 7, 43, 1807, 3263443, ... j. This is defined with the initial term sq = 2 together with the recurrence relation 
s n = 1 +S 0 Sl s 2 ...S„-l. 
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27. Prove that the terms in Sylvester’s sequence are pairwise coprime. 

This is a direct proof that there are infinitely many primes, as no two terms in Sylvester’s sequence share a prime 
factor. Euclid’s proof of the infinitude of primes is similar, but with a proof by contradiction instead. 

28. Show that — + — + — + ... = 1. 

*0 5 1 s 2 

As decimal expansions, continued fractions and ratios had not been invented, the ancient Egyptians expressed 
fractions as the sum of reciprocals of distinct positive integers. It is a remarkable fact that Egyptian fractions can 
represent any positive rational number. One algorithm which proves the possibility of this is the following: 

• Initially express - as - H-1-H... + -. 

b b b b b 

• If there are multiple copies of -, replace one of them with — H--—. 

1 1 n n +1 n(n+ 1 ) 

• Repeat the previous step until all unit fractions are distinct. 

As arbitrarily large fractions can be generated in this manner, we have a proof that the series 

I) = 1 + ^ ^ + ... diverges to infinity. It does so rather slowly, with the first n terms tending to ln(n) + y, 

where y is the Euler-Mascheroni constant. Also, we have yet another proof of the infinitude of primes, because 
we can factorise £(1) as (l + - + - + - + ...) (l + - + - + ...) (l + - + +...).... Each term is the sum of a 

geometric series, resulting in the product expansion f(l) = ’ ■■■■ As eac h °f the terms is finite, 

but the product is infinite, there must be infinitely many primes. 

Another algorithm for generating Egyptian fraction expansions is the greedy algorithm, where we choose the 

4 

largest unused unit fraction less than or equal to the remainder. For example, if we wanted to express - as an 
Egyptian fraction, we would first subtract ^, resulting in , followed by ^, resulting in A, and finally -A, 
resulting in the expansion ^ ^ + - 1 

If, instead, we restrict ourselves to choosing the largest odd unit fraction at each point, the process may continue 
forever. For example, applying this algorithm to ^ generates the remainder of Sylvester’s sequence. This is 

similarly the case for all fractions with even denominators. It is an open problem as to whether the process 
necessarily terminates for all fractions with odd denominators. 


Fermat numbers 


Another doubly-exponential sequence is the sequence of Fermat numbers of the form 2 1 " + 1, namely 
{3, 5, 17, 257, 65537, ...i. You may notice that the first five Fermat numbers are prime, known as Fermat 
primes. Fermat conjectured that all Fermat numbers are prime, with the first counter-example found by Euler. 

■ 2 25 + 1 = 4 294 967 297 = 641 • 6 700 417. [Euler’s factorisation] 

It is unknown whether there are infinitely many Fermat primes. Only the first five are known to be prime; the next 
28 have been proved to be composite! 

29. Prove that if 2 k + 1 is prime, then it is a Fermat prime. 

A regular «-gon of unit side length is constructible if and only if it can be constructed using a compass and 
straightedge. Equivalently, this means that the Cartesian coordinates of each of the vertices can be expressed as a 
finite combination of integers together with the operations {+, -, x, -r, V~}■ Gauss proved that a regular «-gon 
is constructible if and only if n = 2 a p\ p 2 ... Pk, where each p, is a distinct Fermat prime. The only known odd 
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values for n are thus products of distinct Fermat primes. Expressed in binary, these form the first 32 rows of 
Pascal’s triangle modulo 2, namely |1, 11, 101, 1111, 10001, ...1. When each 1 is replaced with a dot and 0 is 
replaced with an empty space, this is the fifth-order approximation to the Sierpinski triangle. 


Certain geometrical constructions, such as trisecting the angle, are possible with paper folding but not with 
Euclidean constructions. Using origami, we can apply the operations {+, -, x, H-, yf ~, } to Cartesian 

coordinates, and thus reach points unattainable with compass and straightedge alone. In this new system, a regular 
n-gon is constructible if and only if n = 2° 3* q\ q 2 ■■■ qk , where each q { is a distinct Pierpont prime (prime 
expressible in the form 2" 3'" + 1). Fermat primes are, by definition, a subset of Pierpont primes. 
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Solutions 


i. 


2 . 


3. 


4. 


5. 


6 . 


7. 


8 . 


9. 

10 . 


11 . 


We have a B (x) + b 0 (x) = Aq + Bq + A\ x + B\ x + ■ ■ ■. Similarly, 

a 0 (x) b 0 (x) = (Ao + A 1 x + A 2 x 2 + ...) (Bo + B t x + B 2 x 2 + ...). Expanding the brackets results in 
Aq Bq + (Aq B\ + A\ Bo) x + (Ao B 2 + A\B\ + A 2 Bo) x 2 + — 


The geometric series 1 + x + x~ + ... is given by -—-. As {1, 2, 3, ... j is the convolution of {1, 1, 1, ...} with 


itself, its ordinary generating function is 


(i-rf 


. The convolution of this with {1, 1, 1, ...} gives the 


triangular numbers with ordinary generating function 


( i -*) 3 


Differentiating each term, we obtain y a 0 (x) = Ai + 2 A 2 x + 3 A 3 x 2 + 4 A 4 x 3 + .... This is the ordinary 
generating function of jAi, 2 A 2 , 3 A 3 , ...i. 


We already know that y- = 1 + x + x 2 + x 3 + .... This can be integrated to yield c + x + ^ x 2 + ^ x 3 + .... As 
ln( 1 - x) evaluates to zero when x is zero, the constant term is zero. Hence, ln( 1 - x) is the ordinary 
generating function of { 0 , 1 , y y y ...}. 


Shift the sequence two places to the left and add it to the original sequence. This results in the zero 
sequence, the EGF of which is the zero function. 


This has the recurrence relation F n+ 2 = F n+ \ + F n . A possible solution is the Fibonacci sequence, 
{0, 1, 1, 2, 3, 5, 8 , ...!. 


fe(x) = ~b( l+< f ,x + 


vH 


VT 
4*. _ gfixy 


(<M 

2 ! 


3 ! 


,")__L( i +(Ax + 


OA-t) 2 

2 ! 


((A A 3 


...j. This is equal to 


The ratio y k ^ = (p a( - b + <fi a( ' b 2) ip a + ... + <p a i// a ^ b 2) + ij/ a ^ b is a symmetric polynomial in tfi and 

if/, so is expressible as a polynomial with integer coefficients in 0 if/ and tf> + 1 //. As the elementary symmetric 
polynomials are themselves integers, so is the ratio —. 

F a 

(r-r? _ _ 0 2 "+< a 2 "+ 3 _ _ 

(0-W 2 (0-W 2 “ 5 5 K<PV) K ) ■ 


f 0 (x) = x + x 2 + 2 x 3 + 3 x 4 + 5 x 5 + ■ ■ ■. It is straightforward to verify from the recurrence relation that 
fo(x) = x + x f 0 (x) + x 2 f 0 (x). Rearranging, we obtain the closed form f a (x) = — . Setting x = 0.01 gives 

1 -x-x 2 


,, . , • U.Ui U.U1 1UU 

us the rational approximation-=-- = —— 

1 - 0 . 01 - 0.0001 0.9899 9899 


This system is deterministic and finite, so must necessarily eventually become cyclic. As it is reversible as 
well, it must be completely cyclic; this solves the first part of the question. We can represent the state of the 
system using the ordinary generating function L„_ 1 + 2 x + L „_3 x 2 + L „_4 x 3 + ... + Lq x n ~ ] , where 0 and 

1 correspond to OFF and ON, respectively. This is a polynomial ^ 2 [x], as each coefficient can be either 0 or 
1 and we consider addition modulo 2. If we follow each step with a rotation to the left (such that L, moves 
into the position of L,-_ 1 ), then we only need to alter the state of Lq depending on L„ \. Refer to this 
composite operation as Step*. The lamp alteration is equivalent to the operation P(x) -> P(x) - 1 + x n ~ l (if 
applicable, or the identity function otherwise), and the rotation is equivalent to P(x) -> xP(x). So, Step* 
performs the polynomial operation P(x) -> xP(x) (mod x” + F !_1 + l); this modulus is the characteristic 
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polynomial associated with this linear feedback shift register. If we apply m steps in succession, then we 
obtain the polynomial x m P(x) (mod x n + x " -1 + 1). If m is the period of the system, then x"' P(x) = P(x) and 

thus x! n = 1. For the second part of the question, we get x " 2 = (x")" = (x” _1 + l)". As n = 2 k , all binomial 


coefficients apart from 


(?) 


= 1 and 


= 1 are even (consider Pascal’s triangle modulo 2, which 


approximates the Sierpinski triangle), thus equal to zero in the field Z 2 . So, (x" 1 + l)“ expands to give 

x" 2- " + 1. With the identities x " 2 = x” 2- " + 1 and x " 2 = x” 2 ~" + x” 2-1 , we obtain x " 2-1 = 1, and thus (Step *)” 2-1 
is the identity function. For the third part of the question, we get 

x " 2-1 = (x ” +1 )” -1 = (x' ,+1 ) 2i = (x" + xf = x "^ -11 + x ” -1 = x" 2- " + x”— 1 . Hence, 

x " 2 = x ” 2-1 + x" 2- " = x " -1 + 2 x” 2- " = x" -1 . Dividing throughout by x " -1 results in x ” 2- ” +1 = 1, so 

(Step *)" 2- ” +1 is the identity function. 


12. The function ‘reflects’ the binary representation. For example, it maps 10011 101 to 1.0111001. 
Multiplying by 2 20 means that it is subsequently shifted 20 places to the left. The fixed points of this 
function are the palindromes of length 21 , of which there are 2 11 . 



j-*-,-.-,- 1 -,-4-1-„-1-L 

-4 -2 


J-*-,- 4 -1-1-L 

2 4 




13. Colour the Eisenstein integers red, green and blue, as in the diagram above. If we multiply all Eisenstein 
integers by o> - 1, we obtain the red Eisenstein integers (a copy of the Eisenstein integers rotated by ^ n and 

scaled by ). The green and blue Eisenstein integers can be obtained by translating the red Eisenstein 
integers by 1 and 2, respectively. Hence, if no Eisenstein integer has two representations using k digits, then 
no Eisenstein integer has two representations using k + 1 digits. By induction, all representations are 
unique. However, we have yet to show that all Eisenstein integers can be obtained in this manner. If all 
Eisenstein integers within the hexagon with vertices {l, -or, o>, -1, or, -cu) can be represented (which can 
be shown using trial and error), then we can represent all Eisenstein integers within the hexagon with 
vertices (l (u> - 1 ), -oj 2 {o> - 1 ), lj(lj - 1 ), -1 (u> - 1 ), cj 2 (o> - 1 ), -a>(u> - 1 )} by multiplying those 
Eisenstein integers by oj - 1 and translating by 1 and 2 to fill in the gaps. By induction, we can represent all 
Eisenstein integers within an arbitrarily large hexagon centred about the origin, and therefore all Eisenstein 
integers. 


14. Yes, consider the function which shifts the Zeckendorf representation of a positive integer one position to 
the left. Due to the Fibonacci recurrence relation, /(/(«)) = /(«) + n. 
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15. Yes, for example {1, -2, 3, -5, 8, -13, 21, .This is sum-full as each term is the sum of the next two 
terms. To prove that it is zero-sum-free, consider all numbers expressible as the sum of the first k terms of 
the sequence. We can prove from a trivial base case and simple inductive argument that this is 
{1 - Fk+ 1 , 2 - Fk+ 1 , ..., -2, -1, 1,2, ..., Fk +2 - 2, Fk +2 - 11 for odd k, and the negation thereof for even k. 
The limiting set is the set of nonzero integers. 



16. We can biject between these strings and non-intersecting pairings of points on a circle by cutting the circle 
at a given point and 'unfolding' it, as demonstrated above. 


17. The insect can only move right or up at each step, as otherwise it would take too long to reach («, n). There 

"2 n 


must be n moves to the right and n moves up, so there are 


(„) 


possible paths in the first part of the 


problem. For the second part of the problem, we represent a horizontal move with a left parenthesis and a 
vertical move with a right parenthesis. This reduces the problem to the previous question, so there are 
' 2 n ' 


C = — 

n n +1 


- ( 2n \ 
+1 l n ) 


paths with this constraint. 


18. Consider 2 n + 2 points on the circumference of a circle, and label one vertex A. Choose another vertex B 
such that the number of vertices on each arc A B is even, and join A and B with a chord. Let the number of 
points right of the chord be 2 k\ the number of points left of the chord must be 2 (« - k). There are Q non¬ 
intersecting pairings of the vertices to the left of the chord, and C„_£ non-intersecting pairings of vertices to 
the right of the chord, giving a total of Q C„_/. pairings. Repeating this for each location of B gives 
Cq C„ + C\ C„_ 1 + ...+ C„ Co possible pairings. This must be equal to C„+ 1 , by definition. 


19. x c 0 (x)^ — x(^Cq + Ci x + C 2 v” + ... j — Co x + (Co C\ + C\ Co) x~ + (Co C 2 + C\ C\ + C 2 Co) x* + ..., which 
simplifies to Ci x + C 2 x 2 + ... = c 0 (x) - Co = c 0 (x) - 1 by Segner’s recurrence relation. So, 
x c 0 (x ) 2 - c 0 (x) +1=0, and we can obtain c 0 (x) by the Babylonian formula for the roots of a quadratic 

1 ±J (_i)2_4 x 

equation. Specifically, we have c 0 (x) = —— -—. As c 0 (x ) -> 1 when x -» 0, the correct root is 


c 0 (x) = 


1 -V 1—4 JT 
2x 


20. We define a sequence {y,-j such that: 

■ yi = A if k is even and x k = I; 

■ yi = B if k is odd and x k = 1: 

■ y,- = C i f k is even and x k = -1; 

■ y t = D if k is odd and x k = -1. 

Consider the string Y] obtained by concatenating the first 2 1 elements of |y,j. It is straightforward to verify 
that Vo = B, and that T /+1 can be obtained from 7/ by applying the L-system with substitution rules: 

T I) C, B -> B C, C -r R A, and D -* DA. The first few terms of { Yj\ are 

B^>BC-*BCBA-^>BCBABCDC-> .... Interpret these strings as sequences of instructions for moving 
an insect on the integer lattice Z 2 : 

■ A: move east; 

■ B: move north; 

■ C: move west; 
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m D\ move south; 

The pattern generated by this is a space-filling curve bounded by the lines y = —x and x = 0 (with the first 
few iterations displayed below). The substitution rules effectively reflect the path in a NNW line through 
the origin, combined with a dilation of scale factor \p2 . Hence, the insect always remains in the octant 
bounded by the lines y = -x and x = Q. Remaining to the right of y = —x is equivalent to the condition 
a n +b n > c n + d„ (for all n e Z + ), where a„ = | {k : y^= A] |, et cetera. This is in turn equivalent to the 
desired inequality. 



21. We can scale the set of intervals without affecting anything, so let’s colour [-1, 1] instead. Note that if we 
have a set of monic polynomials of degrees 0, 1, ...,«, inclusive, which satisfy the condition 
(Jred^ > ( x ) ^ x= .L P M d x )> then the condition holds for all linear combinations of them (namely all 
polynomials of degree < n). If we have a colouring that is symmetric (i.e. x is coloured identically to —x for 
all x g [-1, 1]), then all odd functions satisfy the condition. Similarly, if we have a colouring that is 
antisymmetric (i.e. x is coloured oppositely to —x for all x e [-1, 1]), then all even functions satisfy the 
condition. Begin with the colouring Co, where the interval [-1, 0] is red and [0, 1] is blue. Clearly, all 
constant functions (polynomials of degree 0) satisfy the condition. Let C* denote the complement of C, 
where an interval is red in C* if and only if it is blue in C. We can scale Co to the interval [-1, 0], and scale 
Co* to the interval [0, 1], and append them to form the colouring C[. As Co works for all degree-0 
polynomials, so must the scaled copies of Co and Co*, and thus also C[. As Ci is symmetric, it must also 
work for the function y = x, and therefore all linear polynomials. We define C 2 by performing this operation 
on Ci, resulting in an antisymmetric colouring which must also satisfy y = x 2 , and therefore all quadratics. 
Continuing in this manner, we obtain C2013 which works for all degree-2013 polynomials. Note that this 
colouring is related to the Thue-Morse sequence. 


22. Observe that g(g(X)) = X Y X is the concatenation of g(X) = X Y followed by X. Hence, g n+2 (X) is the 
concatenation of g" +1 (X) followed by g n (X), and thus | g" +2 (X) | = | g n+1 (X) | + | g'\X) |. This is the 
recurrence relation for the Fibonacci sequence. 


23. As opposed to considering strings of symbols, consider instead beads on a necklace (positioned at the kth 
roots of unity, where k is the length of the string). The substitution rules X X Y and Y -» X are then 
equivalent to the alternative rules X -»( Y ) and Y -*(), where we consider { and ) to be ‘half-beads’, where 
)( = X, followed by a rotation of the entire necklace by one half-bead to restore the correct orientation. As 
the substitution rules map each symbol to a palindromic string, and the initial necklace has bilateral 
symmetry, then all subsequent necklaces have bilateral symmetry. We then reflect the ‘start’ of the necklace 
in this axis of symmetry to produce another position. Cutting at these two positions will clearly result in two 
palindromic strings. Using this idea of the half-bead substitution (which preserves the axis of symmetry) 
followed by a rotation, it is straightforward to show that g n (X) has palindromic substrings of length 
F n +1 - 2 and F n + 2, where F n is the nth Fibonacci number. 
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24. We already proved, in the previous exercise, that the first F n+ 2 - 2 symbols of g" +l (X) is a palindrome. As 
we obtain g n (X) by taking the first F n , 2 symbols of the infinite golden string, g"(W), the first F n+ 2 - 2 
symbols of g n (X) also form a palindrome. 


25. Consider the trapezium formed by the centres of the circles and the points of tangency with the real line. 
The hypotenuse has length - (-^ + -^ ], the base has length — - — and the difference between the left and 

2 W q 2 2 ) 92 9l 

right heights is ± - ( - - - I. Applying Pythagoras’ theorem, we obtain 

2 \qi 9 2 2 I 

- [— + = (— - —) + - ; —M . Expanding, this results in ) = 

4 I 9, 2 qF) I 92 91 / 4 \q 2 q 2 ) F £ ’ V 92 9 l / 


so —-= 


9i 92 


1 

9l 92 ' 


Multiplying by z/i 172 gives the identity f/i _ Pi <72 = 1. 


26. We have three equations, namely p 2 q\ - p 1 <72 - 1. P 3 _ Pi 93 = 1. an d P 2 93 - P 3 92 = 1. Subtracting the 
third equation from the second gives us ( p\ + P 2 ) 93 = (91 + 92 ) P 3 , which rearranges to give — = ■ 


93 


9i+92 


27. Let j > i. Then sj =k s,■ + 1, so sj = 1 (mod si). By applying Euclid’s algorithm, j ( - and sj have a greatest 
common divisor of 1 . 


28. By reverse-engineering the definition, we get the recurrence relation ,v „ + 1 - 1 = s„(s,, - 1). Assume that 


^0 S 1 s k 

by induction, this tends towards 1 . 


^. Adding the next reciprocal would give us 


■Sfc+i - ! 


s k +1 


s k+ 2“1 


So, 


29. Assume that k is not a power of two, so k = l p for some odd prime p and integer /. Then, 
( 2 ! + l) (l -2 1 + 2 21 - ... + = 2 k + 1 , thus proving that 2 k + 1 is composite. 
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Inequalities 

Sums of squares 


Over the complex numbers, every polynomial has at least one root by the fundamental theorem of algebra. Over 
the reals, however, it is possible to define polynomials that are always greater than (or equal to) zero. These are 
known as positive (semi)definite functions. One such example is x 1 + y 2 > 0, which is true for all x, y e IR. In 
general, as squares of real numbers are non-negative, sums of squares are also non-negative. This is the most 
basic useful inequality. 

■ If jci , X 2 , ..., x„ e IR and ay, a 2 , ..., a n > 0, then aq xi 2 + aq X 2 2 + ... + a n x„ 2 > 0, with equality if and only if 

xi 2 = X 2 2 = ... = x„ 2 = 0. [Sum of squares inequality] 

Artin proved Hilbert’s seventeenth problem, namely that every positive semidefinite polynomial (and, by exten¬ 
sion, rational function) can be expressed as the sum of squares of rational functions. Charles Delzell later devel¬ 
oped an algorithm to do so. Hence, it is theoretically possible to prove any inequality involving rational functions 
simply by reducing it to the sum of squares inequality. However, this approach is similar in its impracticality to 
building an automobile using Stone Age tools. Certainly, it is impossible in the 270 minutes allocated in the 
International Mathematical Olympiad. Nevertheless, we can still tackle some basic inequalities in this way, 
especially if they are expressible as the sums of squares of polynomials. 

1. Prove that x 2 + y 2 + z 2 > x y + y z + z x. 


Jensen’s inequality 


According to Ross Atkins, “lensen’s inequality is greater than or equal to all other inequalities”. This strongly 
indicates that it is advisable to assimilate it into one’s problem-solving repertoire. It is geometrically very obvi¬ 
ous, namely that the barycentre of a convex figure is located inside it. This makes it all the more remarkable that 
so many useful inequalities, such as the power means inequality, are trivialised by Jensen’s inequality. 

■ A continuous function / is convex over an interval ( a , b) if, for all xi, X2 e (a, b ) and aq, 02 £ [ 0 , 1 ] such that 
o'! + q - 2 = 1 , we have f(a 1 xi + a 2 X2) < aq f(x\) + a 2 f(x 2). If the reverse inequality holds instead, the function is 
concave. [Definition of convexity] 

This is most easily represented with the aid of a diagram: 



For any two points X and Y on the curve of a convex function, any point A on the line segment X Y lies above the 
curve. The Australian IMO team leader, Ivan Guo, created a mnemonic for remembering the shapes of generic 
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convex and concave functions: 

• Ivan: “Concave looks like a cave, and convex looks like a vex.” 

• Someone else: “What’s a vex?” 

• Ivan: “An upside-down cave.” 

2. Let / be a convex function over the interval (a, b). Let {x\, xo, ..., x„] c (a, b) and Icq, tq, ..., a „j c [0, 1] 
such that cq + cq + ... + a„ = 1. Show that 

f(a i x\ + ct 2 X 2 + ... + a n x n ) < O'| f[x\) + a 2 /(.q) + ... + a n f(x n ). [Weighted Jensen’s inequality] 

Observe that the n = 2 case of the weighted Jensen inequality is just the definition of convexity. It is often quoted 
as the slightly less general (but asymptotically equivalent) theorem where <q = cq = ... = a n = -. 

■ Let / be a convex function over the interval (a, b), and let [xi, x 2 , ..., x n j c (a, b). Then 
/(- (q + x 2 + ... + x„)) < - (/(xi) + /(x 2 ) + ... + /(x„)). [Jensen’s inequality] 


3. 

4. 


If {jei, a' 2 , ..., x„) are all positive, show that - (xi + + ... + x„) > Vx i x 2 ... x„ . [AM-GM inequality] 


If a and b are two non-zero real numbers such that a>b, show that 


■I j (xi a + x 2 a + ... + x „ a ) > J j (x\ b + X 2 b + ... + x b ) . [Power means inequality] 


The arithmetic mean, quadratic mean and harmonic mean arise when a is 1,2 and -1, respectively. The geomet¬ 
ric mean is the limit as a -> 0. 


Muirhead’s inequality 


Muirhead’s inequality is a powerful generalisation of the AM-GM inequality. Before we can define it, however, it 
is necessary to introduce the idea of majorisation. 

■ Let ai + a2 + ■■■+ a„ = 1 and b\ + bi + ... + b n = 1 , and all a,- e [ 0 , 1 ] and b t e [ 0 . 1 J. Assume further that the 

sequences are ordered such that ai > a 2 > ... > a„ and bi > Z > 2 > ... > b n . Then {a,} majorises {bj} if and only if 

+ 02 + ■ ■ ■ + a k > b\ + fc 2 + ... + b k for all k e [ 1 , n], [Definition of majorisation] 

The sequence (4, 0, 0, 0), for example, majorises (1, L 1, 1), as they are sorted into descending order and the 
following inequalities hold: 

• 4> 1; 

• 4 + 0 > 1 + 1; 

• 4 + 0 + 0>l + l + l; 

• 4 + 0 + 0 + 0= 1 + 1 + 1 + 1. 

Occasionally, the notation (4, 0, 0, 0) 5= (1, 1, 1, 1) is used to denote this relationship. Majorisation may appear at 
first to be a contrived relation, although it has several equivalent and more enlightening formulations. We inter¬ 
pret a_ = (a |, <q, ..., a n ) as a vector in IR", and consider the set of n ! (not necessarily distinct) vectors obtained by 
permuting the elements of the vector a. They all lie in the (n - l)-dimensional plane with equation 
X\ + X 2 + ■■■ + x n = a\ + a 2 + ■■■ + a n , and form the vertices of a permutation polytope. In general (when all 
elements are distinct), the three-variable case is a hexagon, whereas the four-variable case is a truncated 
octahedron. 
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4 


The red and blue hexagons correspond to the sets {4, 2, 0[ and {3, 2, 1}, respectively. The condition that the red 
hexagon contains the blue hexagon is equivalent to |4, 2, 0} |3, 2, 1}, which in turn is equivalent to the Birkhoff- 

von Neumann theorem : (3, 2, 1) can be expressed as a weighted average of permutations of (4, 2, 0). More subtly, 
this also implies that, for all x, y, z > 0, the polynomial x 4 y 2 z° + z 4 y 2 x° + y 4 z 2 x° + x 4 z 2 y° + z 4 x 2 y° + y 4 x 2 z° 
is greater than or equal to x 3 y 1 z l + z? y 2 x 1 + y 3 z 2 x 1 + x 3 z 2 y 1 + z? x 2 y 1 + y 3 x 2 z 1 ; a fact known as Muirhead’s 
inequality. 

■ Let {xi, X2, ..., x„), (o'!, a 2 , ..., a n | and {/?!, yS 2 , ..., yS„) be sequences of non-negative real numbers. If 

{«i, a 2 , ..., a,,} majorises (ySj, fl 2 , ..., /?„}, then 2 (xi“> X2 “ 2 ... x n a ") > E x 2 ^ 2 ... x n ^. The sigmas denote 

sym sym 

symmetric sums, i.e. sums over all n ! permutations of (xi, x 2 , ..., x„). [Muirhead’s inequality] 

It is discussed in https://nrich.maths.org/discus/messages/67613/Muirhead-69859.pdf. Geoff Smith described how 
Muirhead’s inequality is not well known amongst members of the IMO jury; occasionally certain inequalities, 
which were highly amenable to attack by this method, appeared on the IMO as a result of this. 

5. Prove that, for all positive real numbers x, y and z, we have 

2 x 3 + 2 y 3 + 2 z } >x 2 y + y 2 x+y 2 z + z 2 y + z 2 x + x 2 z. 


Majorisation as fluid transfer 


We have already defined majorisation in terms of decreasing sequences and permutation polytopes. A third 
interpretation involves containers of fluid. In each configuration below, the total volume of fluid is 1 unit; we will 
assume this without loss of generality to simplify things. 




Suppose we have a sequence of containers of fluid, such that if container X is immediately to the left of container 
Y, then X contains at least as much fluid as Y. We are allowed to siphon fluid from X to Y as long as this weak 
inequality is maintained. From the configuration above, we can siphon up to 0.175 units of fluid from the first 
container to the second one without breaking the weak inequality. In the diagram below, we have transferred 0.1 
units. 
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This is known as a valid q-move, where q = 0.1 is the amount of fluid transferred. We can continue in this man¬ 
ner. The fluid transfer lemma states that we can get from an initial sequence So to (arbitrarily close to) a target 
sequence S 0J by applying valid 17 -moves if and only if So majorises S, 0 . A more formal definition follows: 

■ Suppose So and S u = {b\, £> 2 , ■ • b„) are two weakly decreasing sequences of non-negative real numbers, each with 
unit sum and length n. Let e > 0 be a small real number. Define a valid q-move to be an operation 
(«!, •••> a k > a k+i > • • •) a„) -> {at, 02, ..., a k - q, a k+1 + q, ..., a n ) such that the sequence remains strictly 

decreasing and still majorises S w . Then there exists some h <sc e such that there exists a finite sequence of N valid 8 - 
moves So -> Si S N such that each term of S N differs by the corresponding term of S u by at most e if and only if 

So majorises S, ,. [Fluid transfer lemma] 


Proof: 

The ‘only if part is much easier, as it is evident that So majorises Si, which in turn majorises So. By induction, So 
majorises Sjy. If So does not majorise S 01 , then one of the weak inequalities must be broken by an amount h. If we 
let e < -, then S N must be sufficiently close to S (JJ to also break one of those inequalities. Hence, So does not 
majorise Sy, so we have a contradiction. 

For the ‘if’ part, note that there are a finite number of attainable configurations for a given So and 5, and the 
process cannot cycle, so must eventually terminate. Suppose we perform valid Amoves arbitrarily until we reach a 
position Sjv where no further valid Amoves are possible. 

By definition, for each pair of adjacent elements (a,-, a,+ 1 ) in Ay, it must be the case that either: 

• aj - <y, , i < 2 6 (in which case applying a <5-move would break the weakly decreasing criterion); 

• or {a\ + ai + ... + a,) - (bi + £>2 + • ■ • + bd < 6 (in which case applying a Amove would break the majorisation 
criterion). 

If the first case applies to all pairs of adjacent elements, we have a 1 - a n < 2 (n - 1) 6 . So, each element must be 
within 2 (« - 1) 6 of the mean, -. As S /V majorises S I:J , the same must be true of S lo . Hence, corresponding ele¬ 
ments can differ by no more than 4 (n - 1)6, which we can make smaller than e by letting 6 be sufficiently small. 
This leaves the alternative case where there exists some i such that 0 < («i + 02 + ... + a,) - (b\ + • + bi) < 6 

. In that case, we can split the problem into two separate problems: one involving the first i elements of the 
sequences, and the other involving the last n - i. (We need not worry that the sum of the first i elements of .Sjy is 
slightly greater than that of S w , as we can make the difference arbitrarily small. It is not important that things are 
exact, as long as the largest accumulative error is smaller than e.) By inducting on the number of elements, we 
prove the fluid transfer lemma. 
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Returning to the geometric interpretation, this means that we can incrementally move the vertices of the larger 
polytope inwards (varying two coordinates of any vertex at any one time whilst preserving the full symmetry 
group) until it becomes arbitrarily close to ‘suffocating’ the smaller polytope. This is rather intuitive, and implies 
the Birkhoff-von Neumann theorem. 


Energy minimisation lemma 


A corollary of this lemma is the energy minimisation lemma. The proof relies on concepts from real analysis such 
as continuity and convergence, which are taught in most undergraduate maths degrees (such as the Cambridge 
Mathematical Tripos). 

■ Suppose we have a continuous function E : IR" -> R, known as the energy function. Suppose that applying a valid q- 
move to S = {ai, 02. •a„) cannot increase the value of E(S). If we have two sequences So and S w , such that So 
majorises S I:J . then E(So) > EJS,,,). [Energy minimisation lemma] 

Effectively, we associate an ‘energy function’ with the configuration of containers, such that the energy either 
remains constant or decreases whenever a valid 17 -move is applied. The energy minimisation lemma states that 
E(Sq) 2 : £(S W ) if S 0 majorises S w . 

Proof: 

Due to the fluid transfer lemma, we can apply valid (/-moves to S„ to result in a new sequence (S n+ \) where each 
term differs from S u by at most e = e~ n . Starting from So, we produce an infinite sequence of sequences 
{So, Si, ...[ where each term is an increasingly close approximation to S 01 . More specifically, this sequence of 
sequences converges to S w . As £ is a continuous function, this means that {E(Sq), E(S\), ...} must converge to 
£(S W ). Also, as valid g-moves cannot increase the value of E(S), we have E(.S'o) > E(S 1 ) > ...; by the monotone 
convergence theorem, this means £(S W ) is the infimum of these terms, and therefore no larger than any of them. 
The result then follows. 


Generalised Muirhead inequality 


Using the lemmas developed above, it is straightforward to prove the generalised Muirhead inequality. 

6. Let / : (a, b) -> R be a convex continuous function. Let a \ > [3 \ > /T 2 oq 2 0, such that 
ai + ai = pi + P 2 = 1, and let {xi, X 2 I c (a, b). Prove that 

/(ai x\ + ctn xi) + f(a 1 X2 + ct2 xi) > f(P i Xi + P2 *2) + f(Pi x 2 + Pi x\). [Generalised Muirhead’s 
inequality for 2 variables] 

7. Let / : (a, b) -> R be a convex continuous function, and let {a-i, qt 2 , a «! be a weakly decreasing 
sequence of non-negative reals with unit sum. Let {xi, X 2 , .... x„] c (a, b). Define the function 

g(a i, or 2 , • ■ a n , xi, X 2 , ..., x„) = 2 /(cr i x^d + a2 x^) + ... + a n Xcr(n)), where the sum is taken over all 

sym 

n ! permutations <r of {1, 2, ...,«[. Prove that applying a valid (/-move to {a t , 0:2, ..., a n ] cannot cause g to 
increase. 


8 . Let / be a convex continuous function over (a , b) and {x 1; X 2 , ..., x„[ c (a, b). Let {(iq, 02, ..., a n ! an d 
\p\, P2, ..., p„ 1 be weakly decreasing sequences of non-negative reals, each with unit sum. Lurther, the 
former sequence majorises the latter. Prove that 

2 f(ai X <T(\) + a 2 Xo-(2) + ... +a n x (T(n} ) > 2 f(P 1 x a(\) + Pi x a(i) + ■■■+ Pn Mn)), where the sums are taken 

sym sym 

over all n ! permutations <x of {1, 2, ...,«}. [Generalised Muirhead’s inequality] 

We can derive the ordinary Muirhead’s inequality by letting /(x) = e x . Similarly, Jensen’s inequality follows from 
using the sequences [1, 0, ..., 0[ 5 = {-, -, ..., -}. This idea of inequalities generalising other inequalities gives a 
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hierarchy: 



Karamata inequality 


Consider the generalised Muirhead inequality. If we let {xj, X 2 , ■■■, x„j = jl, 0, 0, 0}, then we obtain the 

Karamata inequality as a special case. 

■ Suppose {a,-} majorises {£>,}, and / is a convex function. Then f(a\) + ... + f(a n ) > f(bi) + ... + f(b n ). [Karamata 
inequality] 

This can be extended in another direction. We can assume without loss of generality that £a ( - = Yfri = 1. Effec¬ 
tively, we can consider two new functions, g'(x) =na^ xn -\ and h'(x) = nb\ xn - 1 , which are defined on the open 
interval (0, 1). As {a,} majorises {b,} and the sequences are sorted in descending order, we have that g' and h' are 

k k 

weakly decreasing and fg’(x)dx> fh'(x)dx for all 0 < k < 1. We represent these integrals by g(x) and h(x), 
o o 

respectively. It is clear that g(0) = h( 0) = 0 and g(l) =h( 1) = 1. 



The graph of g ' (x) is a collection of n rectangles of decreasing height. Integrating this to obtain g(x) results in a 
concave line formed from n straight line segments of decreasing gradient. If we take the limit as n tends towards 
infinity, the sequences in Karamata’s inequality are replaced with arbitrary non-negative decreasing functions, g' 
and h '. 
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Suppose g and h are increasing concave functions with domain [0. 1J such that g(0) = h{ 0) = 0, g(l) = /t(l) = 1 and 
g(k) > h(k) for all k e [0, 1J. The derivatives of g(x) and h(x) with respect to x are denoted g' (x) and h' (x), 

l l 

respectively. Let / be an arbitrary convex function. Then ff(g' (x)) dx > ff(h' (x)) dx. [Continuous Karamata 

o o 

inequality] 


Schur’s inequality 


A useful inequality that can be proved using sums of squares is Schur’s inequality. Unlike the previous inequali¬ 
ties, which generalise to arbitrarily many variables, this has just three terms. 

9. Suppose a>b>c and x + z > y > 0. Show that x 2 (a - b) (a - c) + y 2 (b - c)(b - a) + z 2 (c - a) (c - b) > 0. 

[Strong 6-variable Schur] 

It is often quoted as the much weaker result shown below. 

10. Show also that x(a - b) (a - c) + y(b -c)(b - a) + z(c -a)(c — b) > 0. [Weak 6-variable Schur] 

This can be used, with a little work, to form a very powerful inequality. 

11. Let / : IR -» R + be a function expressible as the sum of non-negative monotonic functions. Let g : !R -> IR 
and h : [R ->IR be odd and increasing. Show that 

f(a) g(h(a - b ) h(a - c)) + f(b) g(h(b - c) h(b - a)) + /(c) g(h(c - a) h(c - b)) > 0. [Weak generalised 
Schur] 

When h(w) = w k and g(w) = w, this is known as the Vornicu-Schur inequality. With the additional constraints of 
k = 1 and /(w) = w p , this is simply Schur’s inequality. 

■ If a, b, c e [R + , then a p (a — b)(a — c) + b p {b — c)(b — a) + c p (c - a) (c - b) > 0. [Schur’s inequality] 

It is popularly believed that a suitable combination of Muirhead and Schur can conquer any inequality. This is 
obviously an exaggeration, since neither can prove (for instance) Jensen’s inequality. Nevertheless, most symmet¬ 
ric inequalities in three variables submit to such an attack. 

12. Prove that x 6 + y 6 + z 6 + 3 x 2 y 2 z 2 > 2 x 3 y 3 + 2 y 3 z 3 + 2 z 3 x 3 . 

Nevertheless, we can go further. The strong 6-variable Schur inequality can also be generalised in a similar way to 
its weaker counterpart. We define a function / to be positive-illuminable if f(ax) < a f(x) for all 0 < a < 1 and 
x > 0. Informally, this means that a light source placed infinitesimally above the origin will be able to illuminate 
every point on the curve y = /(x), x > 0 from above. This is demonstrated in the following diagram, where no rays 
emitted from the origin intersect the curve twice. Positive-illuminability is a weaker condition than convexity. 
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We are now in a position to state and prove the stronger generalised form of Schur’s inequality. 

13. Let / : IR -» R + be a function expressible as the sum of non-negative monotonic functions. Let g : !R -> IR 
and h : [R -> IR be odd, increasing and positive-illuminable. Show that 

/(a ) 2 g(h(a - b ) h(a - c)) + f(b) 2 g(h(b - c ) h(b - a)) + /(c ) 2 g(h(c - a) h(c - b )) > 0. [Strong generalised 
Schur] 


Calculus 


Although ideas of limiting processes and integration can be traced back to Archimedes, our modern understand¬ 
ing of calculus was developed much later. It was conceived independently, and almost simultaneously, by Sir 
Isaac Newton and Gottfried Leibniz. As Newton only considered differentiation with respect to time, we currently 
use Leibniz’s (much clearer) notation instead. 

In the explorations of various general inequalities, terms such as ‘increasing’, ‘convex’ and ‘positive-illuminable’ 
appeared. It is possible to express each of these concepts in the environment of calculus. We will represent the 
first derivative of a function /(x) with /' (x) . The second derivative, /" (x), is also of interest. 

■ A differentiable function / is increasing on an interval / if and only if /' (x) > 0 for all x e I. 

This is intuitive. The derivative measures the rate of increase of a function, which we require to be non-negative. 
Convex functions have an increasing gradient, so we require the second derivative to be positive. 

■ A differentiable function / is convex on an interval 1 if and only if /" (x) > 0 for all x e I. 

The properties ‘decreasing’ and ‘concave’ are similarly defined, but with the ‘>’ operator reversed in direction. 

14. Prove that e 2x + e 2y >2 e x+y for all x, y e IR. 

So far, we have considered calculus in one variable. Nevertheless, it is possible to delve into the realms of 
multivariate calculus. The main approach is to consider the partial derivative of a function with respect to a 
variable. To do this, we allow one variable to vary and force the others to remain constant. For example, 
z= y 2 + 2 xy has the partial derivatives — = 2 y and — = 2 y + 2 x. 

ox oy 

If we want to show that the value of a function z = f(x, y) increases as we move parallel to the x-axis, we need to 

dz 

show that — is always non-negative. To investigate how it changes as we move parallel to the vector (3, 2), we 
are interested in 3 — +2 —. 

ox oy 

15. Let x, y and z be positive real numbers. Prove that 4 (x + y + z) 3 > 27 (x 2 y + y 2 z + z 2 x). [BM02 2010, 
Question 4] 

Warning: A stationary point is a point where all partial derivatives are zero. Be careful, however, as this could be 
a point of inflection or saddle point instead of a minimum or maximum. Also, calculus does not guarantee that a 
particular extremum is global; for example, x 3 - 3 x has a local minimum at x = 1, but still takes on arbitrarily low 
values. You should bear this in mind when attempting to tackle a problem using calculus, especially Lagrange 
multipliers. If you want to use calculus to locate an extremum of a function, it is invariably a good idea to sketch a 
graph of the function first. Unfortunately, your two-dimensional paper and three-dimensional imagination are 
insufficient when there are many variables. 


Lagrange multipliers 


Suppose we have some additional constraints on the variables in an inequality. For example, we encountered a 
problem where we had to minimise x 2 + y 2 + z 2 subject to the constraint that x 3 + y 3 + z 3 -3xyz=l. One way of 
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incorporating the side constraint is to homogenise the inequality. In that example, it would involve making all 
terms in x 2 + y 2 + z 2 of degree zero. In this case, it ‘reduces’ to the following problem, which is really quite 
horrible: 


• Find the minimum value of 




(x 3 +y 3 +z 3 -3xyz] 


-, where x, y, ze(R. 


If we could guess that the minimum value is 1 (which is by no means obvious), then it is equivalent to proving 
that (jc 2 + y 2 + z 2 ) 3 > (x 2 + y 3 + z 3 - 3 jc y z)“. One could attempt to bash this degree -6 polynomial inequality with 
any combination of Muirhead, Schur and the u v w method (as we shall do shortly), but it lacks a certain elegance. 


A method that is more amenable to incorporating side constraints into problems is the use of Lagrange multipli¬ 
ers, which enable the application of calculus. If we want to minimise the value of / (which is a function of some 
variables) subject to the algebraic constraint g = 0 (where g is a function of those variables), then we introduce a 
new variable, A. We consider the function A = / + A g, and minimise it by locating its stationary points. We’ll 
start with a simple non-trivial example in two variables: 

• Find the minimum and maximum of f = x 2 + y 2 - x y, subject to the constraint x 2 + y 2 = I. 



The contours of / are ellipses of the form f = x 2 + y 2 - x y = k, and we want to find the ones that touch the circle 
g = x 2 + y 2 - 1=0. Let A = / + A g. Consider a point of tangency, such as that highlighted in the diagram above. 
We imagine setting a new orthogonal coordinate system centred at this point, with an axis normal to the common 

tangent. Call this coordinate a>. The partial derivatives — and ^ are both non-zero, whereas the partial deriva- 

v 

tives with respect to the other axes are all zero. Hence, if we let A = , the partial derivatives of A with respect 

did 

to all of the (new) axes are zero, so the partial derivatives are all zero. In other words, any extremal point of / on 
the curve g = 0 is also a stationary point of A. This method only works if — is non-zero at the extremal points, so 

it is important to verify this before proceeding with the method of Lagrange multipliers. In this example, g is 
quadratic and only stationary at the origin, so we can safely apply the method. 

• Find the stationary points of A = x 2 + y 2 - x y + A(jc 2 + y 2 - l). 


Equating — = 0 and — = 0, we have the equations 2x-y + 2Xx = 0 and 2y-x+2Xy = 0, which simplify to 
2(A+l) = -= -. Hence, y 2 = x 2 and thus x = ±y, from whence we obtain all four tangency points: 

—zz, ± ). They correspond to the maximum value / = 3 and minimum value / = 1. 
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We shall now contemplate the original problem. As shown above, let f = x 2 + y 2 + z 2 and 
g = x 3 + y 3 + z 3 - 3 x y z - 1. This simplification is more appetising than the previous attempt at homogenising 
the problem. 

• Find the stationary points of A = x 2 + y 2 + z 2 + A(x 3 + y 3 + z 3 - 3 x y z - 1). 

Differentiating it with respect to x gives the partial derivative ^ = 2 x + 3 A x 2 - 3 A y z, which we wish to equate 
to zero. Similarly, by differentiating with respect to y and z, we obtain two more equations. (The final equation, 
77 = 0, is precisely the original side constraint, x 3 + y 3 + z 3 - 3 x y z = 1.) 

oA 

More interestingly, we can multiply 2x + 3Ax 2 -3Ayz = 0 by x to result in the cubic equation 
3Ax 3 + 2x 2 = 3Axyz. Hence, x, y and z are all solutions of the equation 3 A x 3 +2 x 2 = k, where k = 3 A x y z. 
Either x, y, z are the three distinct roots, or two of them are equal. In the former case, we have x y + y z + z x = 0 

3 

by Vieta’s formulas, resulting in the equation x 3 + y 3 + z 3 - 3 x y z = (x + y + z) 3 = (x 2 + y 2 + z 2 ) 2 , and thus 
x 2 + y 2 + z 2 = 1 and we are done. In the other case, we can assume without loss of generality that y = z and thus 
eliminate a variable. 

• Find the stationary points of A = x 2 + 2 y 2 + A(x 3 + 2y 3 -3 xy 2 - l). 

We obtain ^ = 4y + 6Ay 2 -6Axy = 0, which has solutions y = 0 and A(x - y) = |. The former case clearly 
results in (x, y, z) = (1, 0, 0), again giving a minimum of x 2 + y 2 + z 2 = 1. The other solution is more intricate. By 
considering the other partial derivative, — = 2x + 3Ax“-3Ay=2x + 3 A(x + y) (x - y) = 0, we get 
2 x + 2 (x + y) = 0. This gives 2 x = —y, which can be substituted back into the original equation to give 
-27 x 3 = 1, or (x, y, z) = , |, This also attains the value of x 2 + y 2 + z 2 = 1. We now just need to choose 

the minimum value of x 2 + y 2 + z 2 , which is 1. 

16. Find the distance from the closest points on the hyperbola x y + x 2 = 1 to the origin O = (0, 0). 
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The uvw method 


Symmetric polynomial inequalities in three positive real variables have frequently appeared in olympiads. The 
‘uvw method’ uses the idea of expressing these as polynomials in the ESPs. 

• 3 u = x + y + z 

• 3 v 2 = xy+yz + zx 

• w 3 = xy z 

17. If x, y, z e 1R, prove that ('x 2 + y 2 + z 2 f > [x 3 + y 3 + z 3 - 3 x y z)~■ When does equality occur? 

The full power of the u v w method is realised when we require that x, y, z. > 0. By the AM-GM inequality, 
u > v > w with equality if and only if x = y = z. This leads to an approach for tackling all three-variable symmetric 
polynomial inequalities of reasonably low degree. 



0.0 0.5 1.0 1.5 2.0 

The blue plane x + y + z = 3u intersects the red two-sheeted hyperboloid x y + y z + z x = 3 v 2 in a conic. It is the 
intersection of the blue plane with the sphere with equation x 2 + y 1 + z 2 = 9 u 2 - 6 v 2 , so is a circle. If we fix u and 
v, we can ‘move’ around the circumference of the circle and examine how w varies. This can be accomplished by 
the method of Lagrange multipliers. 
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18. Show that the stationary points of A = x y z + X[x 2 + y 2 + z 2 + 6 v 2 - 9 n 2 J + fi(x + y + z - 3 u) occur only 
where two of the variables are equal. 

If the circle intersects the planes x = 0, y = 0 and z = 0, however, we must also account for the ‘boundary case’ 
where one of the variables is zero. 

■ If we want to find the maximum or minimum values of w 3 for some fixed u and v 2 , it suffices to only check the cases 
where x = 0 or y = z. 

Now let’s suppose we are trying to prove a symmetric polynomial inequality where the degree of the greatest term 
is 8 . It can be expressed as the inequality F w 6 + 2 G w 3 + H > 0, where F. G, H are functions of it and v 2 . This is 
a quadratic in w 3 , so its extreme values occur when either w 3 is minimised, maximised, or reaches the stationary 
point. By differentiating the above expression with respect to w 3 , this occurs when F w + G = 0, i.e. w = -j. 

■ To prove the inequality F vi ' 6 + 2G w 3 + H > 0 (which is an arbitrary symmetric polynomial of degree d < 8 in three 
variables), where F, G, H are polynomials in u and v 2 , it suffices only to check that it holds under each of the 
following three cases: 

■ One variable is zero (without loss of generality, x = 0); 

■ Two variables are equal (without loss of generality, y = z); 

■ F w + G = 0. (only relevant where d > 6 ). [Generalised Tejs’ corollary] 

F tv + G = 0 is a degree-(d - 3) symmetric polynomial equation, where d is the degree of the inequality. In some 
problems, you may be sufficiently fortunate to find that equality can never occur, for instance if F w + G > 0 in all 
cases. Since this is a degree-5 inequality, it can be itself verified using Tejs’ corollary. 


Gamma function 


The function f(x) = 2 X can be defined on the positive integers by the product 2x2x... x2 . If we want to extend 

n times 

this function to the reals and complex numbers, we can do so by using the recurrence f(x + 1) = 2 fix). This gives 
an uncountably infinite number of possible contenders. If we insist that the function is continuous, differentiable 
and is ‘logarithmically convex’, then there is only one possible function: f(x) = exp(.xlog( 2 )), where 

X 2 X 3 

exp(x) = e‘ = l+ x+ — + ^7 + ••• and log(x) is its inverse. 

Euler did the same for the factorial function. The Gamma function is defined by T(x) = (x - 1)! for x e IN and 
more generally over the positive complex numbers with positive real part by the convergent integral 

CO 

T(x) = t *” 1 dt. For complex numbers with negative real part, we can extrapolate using the recurrence 

o 

T(x + 1) = xT(x). For example, it is known that T(^) = x[~n , so T(-^) = -2 xf~n and T(-|) = -j x[~n . 
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A plot of | r(z) | is shown above for complex values of z. Observe that for non-positive integers, the function is 
undefined. 

From the integral definition of the Gamma function, it is straightforward to establish this identity: 

■ — = — • jr At f ! dt. [Identity involving the Gamma function] 

x o 

If we want to show that -^ + -^ + -^ > 0, we can convert it to the equivalent inequality 

oo 

— J(a e~ At + b e~ Bl + c e ~ cr ) t JC ~ 1 d t > 0. If x is positive and a e~ Al + b e~ B ' + c e^ Ct > 0, the integrand and 
o 

integral are therefore also non-negative. 

" " a, a 

19. Prove that Y Y — LJ — £ 0, where c, p u p 2 , ..., p„> 0 and ai, cit, ..., a n e [R. [KoMaL, Problem A493, 

i=t j=\ 

November 2009] 

An interesting fact concerning the Gamma function is that the 

is given by 711 ' . One can verify easily that this agrees with 

r (2 + 1 ) 

sphere. 

20. The E& lattice consists of points in (R 8 such that the coordinates are either all integers or all half-integers, 
and the sum of the coordinates is an even integer. Suppose we place (hyper)spheres of radius r, centred at 
each point in £g. What is the maximum value of r such that the spheres are disjoint, and what is the density 
of the resulting sphere packing? 


volume of a n-dimensional hypersphere of radius r 
known formulae for the line segment, circle and 
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Solutions 

1. This follows from the non-negativity of (x - y ) 2 + (y - z ) 2 + (z - x) 2 . 

2. By induction on the number of variables, the barycentre B = (cty x\ + ... + a n x„, ct\ f(x i) + ... + a n f(x„)) 
must lie in the convex hull of the points [Pj = (x,, fix,))]. As every point on the perimeter of the convex hull 
lies above the curve by the definition of convexity, so too must every point in the interior of the convex hull, 
including the barycentre. 

3. This is the special case of Jensen’s inequality where f(x) = e x . 

4. We can replace a and b with - and 1, respectively, without altering anything, and thus assume without loss 
of generality that b = 1. Applying Jensen’s inequality to f(x) = x a gives the desired result. 

5. (3, 0, 0) majorises (2, 1, 0), so this follows from Muirhead’s inequality. 

6. Without loss of generality, re-define the interval so that <?i = 0, = 1. We then need to prove that 

/(x 2) + f(x\) > f(pi x\ + fa X2) + f(p 1 X2 + P2 x i )• By the definition of convexity, we have 

f(Pi x l + P2 X2) £ Pi f(x 1 ) + p 2 /(x 2 ) and /(/?i x 2 + P 2 x 1 ) < p x f(x 2 ) + p 2 f(x 1 ). Adding these together 
yields the desired inequality. 

7. Suppose we apply a c/-move to a, and a i+l . Consider each sum of the form 

f(a 1 x^ i) + ... + a t x^i) + a i+ i x^+i) + ... + a„ x rr ( n) ) +. Note that this is equal to 

f{a\Xcr(\)+ ... + Qli Xcr(i+ 1 ) + Ctj+I Xcr(i) + Xcr(n)) 

g(aj x^i) + a i+ 1 Jtcrfj+i)) + g(aj + cr , +1 x^) for some convex function g(w) = f(w + k). The previous 
theorem tells us that this cannot increase when a, and a i+ \ are replaced with /?, and p i+ \. Apply this 
principle to all ~ pairs of terms. 

8 . This is a corollary of the energy minimisation lemma and the previous question. 

9. Let a - b = d and b - c = e. Then the inequality becomes x 2 d(d + e) - y 2 d e + z 2 e(d + e) > 0. Rearranging, 
we obtain the equivalent (xd - ze ) 2 + ((x + z ) 2 - y 2 ) d e > 0. This is clearly true if (x + z ) 2 a y 2 . 

10 . x 2 + z 2 > y 2 is a weaker condition than x + z > y, so the result follows from the previous question. 

11. Assume without loss of generality that a>b>c, and let d-a-b and e = b - c. For any non-negative 
monotonic function /, we have f(d) + f(c) > f(b); hence, this must be true of any sum of non-negative 
mono tonic functions. The problem reduces to showing that 

f(a) g(h(d) h(d + e)) + f(c) g(h(e) h(d + e)) > f(b) g(h(e) h(d)). As g and h are increasing, we have 
f(a) g(h(d) h(d + e )) + /(c) g(h(e) h(d + e)) > (/(a) + /(c)) g(h(e) h(d)), which in turn must be greater than 
f(b) g(h(e) h{d )), as f(a ) + /(c) > f(b). 

12. Obviously, the worst-case scenario is when all variables are positive. Expanding the Schur inequality 
x 2 (x 2 - y 2 ) (y 2 - z 2 ) + y 2 (y 2 - z 2 ) (y 2 - x 2 ) + z 2 (z 2 - x 2 ) (z 2 - y 2 ) > 0 gives the variant 

x 6 + y 6 + z 6 + 3 x 2 y 2 z 2 £ x 4 y 2 + x 2 y 4 + y 4 z 2 + y 2 z 4 + z 4 x 2 + z 2 x 4 . The other inequality, 

x 4 y 2 + x 2 y 4 + y 4 z 2 + y 2 z 4 + z 4 x 2 + z 2 x 4 > x 3 y 3 + y 3 z 3 + z 3 x 3 , is a simple application of Muirhead. 

13. Again, assume without loss of generality that a>b>c, and let d = a - b and e = b - c. As h is positive- 
illuminable, h(m + n) > h(m) + h{ri) for all m, n g R + . As g and h are increasing and odd, we have 
/(a ) 2 g(h(d) h(d + e )) + /(c ) 2 g(h(e) h(d + e )) > /(a ) 2 g(h(d ) 2 + h(d) h(e)) + f(c ) 2 g(h(e ) 2 + h(d) h(e)). 

Hence, we can reduce this, effectively, to the case where h is the identity function. As g is positive- 
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illuminable, we can express g(w) = G(w) 2 w for all w e IR + , where G is an increasing function. Now, we let 
x = f(a) G(h(a - b) h(a - c)) and define y and z similarly. We have x + z > y by the same argument as in the 
proof of the weak generalised Schur inequality. The result then follows from the strong 6-variable Schur. 


14. We let f(x) = e 2x . Differentiating this twice gives 4 e 2x , which is positive-definite. Hence, / is convex and 
we can apply Jensen’s inequality to show that (/(x) + f(y)) > 


15. Let w = f(x, y, z) = 4 (x + y + z) 3 - 27 (x 2 y + y 2 z + z 2 x). Differentiate with respect to x to give the partial 
derivative ~ = 12 (x + y + z) 2 - 27 (2 x y + z 2 ). The cyclic sum is 

^ + IT + IT = 36 (x + y + z) 2 - 27 (x 2 + y 2 + z 2 + 2 x y + 2 y z + 2 z x) = 9 (x + y + z) 2 . This is obviously 

ox ay oz v ’ 

positive, so the function increases as we move parallel to the vector (1, 1, 1). Hence, we have 

/(x + h, y + h, z + h)> /(x, y, z) for all h > 0. Assume without loss of generality that x < y and x < z. Using 

the previous statement, /(x, y, z) > f( 0, y - x, y - z). To prove the strict inequality in general, therefore, we 

need only prove the weak inequality when one of the variables is zero. We have reduced the problem to 

showing that 4 (y + z) 3 a 27 y 2 z. This is evident from the factorisation 

4 (y + z) 3 - 27 y 2 z = (4 y + z) (y - 2 z) 2 . 


16. We wish to minimise x 2 + y 2 subject to the constraint x y + x 2 = 1. We use Lagrange multipliers to obtain 
A = x 2 + y 2 - A(x y + x 2 - l). We equate each of its partial derivatives, 2x-Ay-2Ax and 2 y - A x, to 

zero. The latter gives us the value of A, namely —, so we can substitute it into the other equation and obtain 
2x-2- — 4 y = 0. We can multiply throughout by ^ x to give the quadratic x 2 - y 2 - 2 x y = 0, or 
j - 21-j - 1 = 0. The Babylonian formula gives us - = 1 ± XpT. It is sensible to draw a graph of the 
hyperbola to confirm that the root we are looking for is actually - = 1 + . Hence, x = 11 + V2~j y and 

y = [yfl - 1 j x. Substituting this into the equation of the hyperbola gives x 2 = ——. Similarly, we have 

3,2 = [x[2 - l) 2 x 2 = (3 - 2 V2") Jc 2 ,, 

of that, namely 


4-2 V2 .— 

, giving x 2 + y 2 =-= 2 V 2 - 2. The distance is the square-root 

V2~ 


■ V 2 y/T ■ 


■2 . 


17. The inequality (x 2 + y 2 + z 2 ) 3 > (x 3 + y 3 + z 3 - 3 x y z)~ can be expressed in the uvw notation as 

(9 u 2 - 6 v 2 ) 3 > ((3 u) (9 n 2 - 9 v 2 ))“. We can divide throughout by 3 6 , giving the equivalent inequality 

(m 2 - v 2 ) 3 > (m 3 - v 2 u)~, which expands to u b - 2 u 4 v 2 +;» 2 v 4 -^v 6 >« 6 - 2 u 4 v 2 + u 2 v 4 . Observe 

that the first two terms on each side of the equation cancel, so we wish to prove that u 2 v 4 - ^ v 6 > u 2 v 4 , 

or ^ u 2 v 4 > v 6 . We can neatly divide by ^ v 4 (which must be positive, since 9 v 4 = (x y + y z + Z x) 2 ), 

giving 9 u 2 > 8 v 2 . As u 2 is necessarily positive and greater in magnitude than v 2 , this is true. Equality occurs 
when v 4 = 0, i.e. xy+yz + zx = 0. 


18. Firstly, obtain the partial derivative — = yz + 2Ax + /r = 0. We can multiply throughout by x to get 

2 A x 2 + /r x - w = 0, where w = x y z. As x, y, z are all roots of this quadratic equation, which only has two 
roots, at least two must be identical. 


J II It 

19. We note that this is equivalent to — fr c_1 Y Y a,- a ,■ e~ <Pi+p ^' dt>0. Observe that this simplifies to 

r(e) 0 <=1 J = 1 

1 °r ( n 

— Jr|j ajaj £- pi 'j dt > 0, which is necessarily true. 

20. The points in the lattice (regarded as vectors) clearly form a group under addition, so we need only calculate 
the minimum distance from the zero vector to another vector a. If the coordinates of a are all half-integers. 
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the minimum norm (squared length) of a is 8 ) = 2. Similarly, the closest integer point in the lattice is 

(1, 1, 0, 0, 0, 0, 0, 0), with a norm of 2. Hence, the maximum value of r is ^ y[2 , so the volume of each 

7I 4 r 8 id 

sphere is —= —. We now need to determine the number of lattice points per unit volume. The points in 

Z 8 and (Z + ^ ) S each have one point per unit volume, so Z 8 (J (Z + ^ ) 8 has two points per unit volume. Eg 
comprises half of these points (those with an even sum of coordinates), so it has one point per unit volume. 

4 

Hence, the sphere packing has a density of . 
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Projective geometry 

Projective geometry is an extension of Euclidean geometry, endowed with many nice properties incurred by 
affixing an extra ‘line at infinity’. Certain theorems (such as Desargues’ and Pascal’s theorems) have projective 
geometry as their more natural setting, and the wealth of projective transformations can simplify problems in 
ordinary Euclidean geometry. 


The real projective plane 


In Euclidean geometry, we assign a coordinate pair ( x , y) to each point in the plane. In projective geometry, we 
augment this with an extra coordinate, so three values are used to represent a point: ( x, y, z). Moreover, scalar 
multiples are considered equivalent; ( x , y, z) and (A x, A y, A z) represent the same point. R = (0, 0, 0) is not part 
of the projective plane, but can be regarded as a ‘projector’, from which all points, lines, circles et cetera emanate. 

Since scalar multiples of points are considered equivalent, we can identify points in IRP 2 (the real projective 
plane) with lines through the origin. Projective lines are identified with planes through the origin, and are of the 
form ax + by + cz = 0. Note that this equation is homogeneous: all terms are of first degree. In general, all 
algebraic curves are represented by homogeneous polynomials in x, y and z. 



In the diagram above, a triangle ABC is shown in the reference plane (z = 1). The point A is ‘projected’ from R 
along the orange line, intersecting the reference plane at A. Similarly, the ‘plane’ containing the yellow triangle 
represents the line B C. Horizontal Tines’ such as the blue one do not intersect the reference plane, so correspond 
to points ‘at infinity’. The horizontal ‘plane’ (parallel to the reference plane) through R represents the line at 
infinity (z = 0). Parallel lines on the projective plane can be considered to meet at a point on the line at infinity. 

1. Prove that any two distinct lines intersect in precisely one point. 


2 . 


Show that the equation of a line through points (xi , y i, Zi) and (x 2 , yz, z.z) is given by det 


' x y z ' 
xi yi z i 
,x 2 yz Z2 > 


= 0 . 


You have shown that any two points share a common line, and any two lines share a common point. This suggests 
a fundamental interchangeability between lines and points, known as projective duality. We shall explore this 
later. 


3. Let ABC and A' B' C' be two triangles. Let A B meet A ’ B' at C", and define A" and B" similarly. Show 
that A A', B B' and C C’ are concurrent if and only if A", B" and C" are collinear. [Desargues’ theorem] 
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If two triangles exhibit this relationship, they are said to be in perspective. The point P is the perspector, and the 
line A" B" C" is the perspectrix. As Desargues’ theorem is the projective dual of its converse, you only need to 
prove the statement in one direction. Surprisingly, it is actually easier to prove Desargues in three dimensions 
(where the triangles are in different planes); the two-dimensional result then follows by projecting it onto the 
plane. 

4. Let ABC be a triangle and A be a point inside the triangle. The lines A X, BX and C X meet the circle 
ABC again at P, Q and R, respectively. Choose a point U on X P which is between X and P. Suppose that 
the lines through U which are parallel to A B and C A meet X Q and X R at points V and W respectively. 
Prove that the points R, W, V and Q lie on a circle. [BM02 2011, Question 1] 


Cross-ratio and harmonic ranges 


You may have encountered homogeneous projective coordinates before in the form of areal (or barycentric) 
coordinates. Even though qualitative properties such as collinearity and concurrency can be defined in terms of 
any projective coordinates, to compare distances they must first be normalised, i.e. projected onto the reference 
plane. 

Firstly, we define a vector, n, perpendicular to the reference plane. (For Cartesian coordinates, we generally allow 
z = l to be the reference plane, so n = (0, 0, 1). For areal coordinates, x + y + z = 1 is the reference plane, so 
n = (1, 1, 1). In general, if the reference plane is given by a x + b y + c z = 1, the vector n = (a, b, c). For a given 
vector x in the projective plane, it is normalised by the operation x -> — 

5. If A, B, C and D are collinear points represented by normalised vectors a, b, c and d_, respectively, show that 

AB _ a-b 
C~D £_ - 



Suppose we have three points, A, B and C, which are collinear. We can select an arbitrary point P, and a further 
arbitrary point Q lying on the line P B. Let Q C meet P A at R, and Q A meet P C at S. Finally, let R S meet line 
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ABC&tD. 

6 . Prove that AE ’ CD = -1, irrespectively of the locations of P and Q. (Hint: this can be done by applying two 

BC-DA 

similar theorems in quick succession.) 

It is a remarkable fact that the location of D does not depend on that of P and Q. B and D are described as 
projective harmonic conjugates with respect to the line segment A C. For any four collinear points, the quantity 

(A, C; B, D) = AB CD is known as the cross-ratio. If (A, C, B, D) = - 1, we say that they form a harmonic range. 

BC-DA 

■ (A, C; B, D) = -1 is equivalent to A and C being inverse points with respect to the circle on diameter B D , which is in 
turn equivalent to the circles on diameters A C and B D being orthogonal. [Equivalent definitions of harmonic 
range] 

There is another remarkable and useful fact concerning harmonic ranges. Let P be a point not on the line ABCD. 
Then any two of the following four properties implies the other two: 

• LAP C = -\ 

2 

• P A is an angle bisector of LB PD\ 

• PC is the other angle bisector of L B P Z); 

• (A, C;B, D) = - 1. 



By Thales’ theorem, the first of these conditions is equivalent to P lying on the circle with diameter A C. Hence, if 
P lies on either of the intersection points of the (orthogonal) circles on diameters A C and B D, the four lines 
through P divide the plane into eight equal octants of angle - 


Projective transformations 


Returning to the idea of representing points in the projective plane as three-element vectors, we can consider the 
group of operations represented by linear maps x^Mx, where M is a non-singular matrix. These are known as 
projective transformations, or collineations. 

7. Show that applying a projective transformation to a line results in another line, and thus that collinear points 
remain collinear. 

A generalisation is that degree-d algebraic curves are mapped to degree-c/ algebraic curves by projective 
transformations. Hence, conics are preserved. Moreover, it is possible to choose a projective transformation 
carefully to map a given conic to any other conic, with three real degrees of freedom remaining. 

8 . Show that applying a projective transformation with matrix M to a point x = p + Aq results in a point with 

M p +A M q 

normalised coordinates-=-—. 

n ■ M p+n ■AM q 
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9. Hence show that the cross-ratio (A, C; B, D ) of four collinear points is preserved under projective 
transformations. 

10. For any four points A, B, C and D, no three of which are collinear, show that there exists a unique projective 
transformation mapping them to (1, 0, 0), (0, 1, 0), (0, 0, 1) and (1, 1, 1), respectively. 

11. Hence show that there exists a unique projective transformation mapping any four points (no three of which 
are collinear) to any other four points (no three of which are collinear). 

The last of these theorems enables one to simplify a projective problem by converting any quadrilateral into a 

parallelogram. This enables one to find all of the harmonic ranges in the complete quadrangle displayed below. 

Try to spot as many as you can! 



z 


X 


12. The diagonals of the quadrilateral AB CD meet at X. The circumcircles of AB X and CDX intersect again 
at Y ; the circumcircles of B C X and D AX intersect again at Z. The midpoints of the diagonals A C and B D 
are denoted M and N, respectively. Prove that M, N, X, Y and Z are concyclic. [Sherry Gong, Trinity 
2012] 

Another configuration occurring in many instances is a quadrilateral with an inscribed conic. With a projective 
transformation, we can convert the quadrilateral into a parallelogram. To simplify matters even further, a 
(possibly imaginary) affine transformation is capable of turning the conic into a circle. The symmetry of the 
configuration implies that the diagonals of the quadrilateral are concurrent with the lines joining opposite points 
of tangency. 



When viewed as a complete quadrilateral (four lines in general position intersecting in six points), we have three 
of these concurrency points and plenty of harmonic ranges! The centre of the inscribed conic also happens to lie 
on a line passing through the three midpoints of pairs opposite vertices, by Newton's theorem. This line is thus the 
locus of the centres of all possible inscribed conics. 
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Polar reciprocation and conics 

One convenient way to demonstrate projective duality is by creating a bijection between lines and points, such 
that collinear points map to concurrent lines. There are several ways in which this can be defined, but one of the 
most elegant is polar reciprocation: 



13. Consider a point P in the unit circle. Draw a line A through P, intersecting the unit circle at A and B. Let Q 
be the projective harmonic conjugate of P with respect to A B. Show that the locus of Q as A varies is a 
straight line. 

This locus is known as the polar of P, and P is its pole. Moreover, there is nothing special about the unit circle in 
projective geometry, so this construction generalises to any conic. 

14. Let ABC be a scalene triangle, and let T be its nine-point circle. T intersects B C at points P and Q\ the 
tangents from T at P and Q intersect at A'. Points B' and C' are defined similarly. Prove that the lines A A', 

B B’ and C C’ are concurrent. [NST4 2011, Question 3] 

The easiest route to solving the above problem is to prove the much more general theorem that a triangle and its 
polar reciprocal triangle (with respect to any conic) are in perspective. This is (like many results in projective 
geometry!) known as Chasles’ theorem. 

Polar reciprocation generalises to three dimensions, where we can reciprocate about a sphere (or, more generally, 
quadric surface). 

15. What is the polar reciprocal of a dodecahedron with respect to its circumsphere 5? 
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More generally, the dual of the regular polyhedron with Schlafli symbol {a, b) has the Schlafli symbol {b, a}. For 
higher dimensions, we simply reflect the symbol. The tetrahedron, with the palindromic Schlafli symbol {3, 3), is 
thus self-dual, as is the square tiling with Schlafli symbol {4, 4). More generally, a simplex has Schlafli symbol 
{3, 3, 3, ..., 3, 3) and a hypercubic tessellation has Schlafli symbol {4, 3, 3, ..., 3, 4j, both of which are palin¬ 
dromic. For four-dimensional solids, the 4-simplex {3, 3, 3} is not the only self-dual regular polychoron; we also 
have the ‘24-celT with Schlafli symbol {3, 4, 3) (meaning that three octahedral cells are clustered around each 
edge). 


Circular points at infinity 


In projective Cartesian coordinates, the equation of a circle is of the form x 2 + y 2 + bxz + cyz + dz 2 = 0. Note 
that the points (1, i, 0) and (i, 1, 0) satisfy this equation, where i is the imaginary unit. Hence, all circles can be 
considered to pass through two imaginary ‘circular points’ on the line at infinity. Indeed, a circle can be defined 
as any conic passing through both circular points. In the «-dimensional complex projective plane (CP"), we have a 
(n - 2)-sphere on the line at infinity contained by all (n - l)-spheres. 

Apart from the circular points, other imaginary points on the complex projective plane (CP 2 ) are scarcely useful, 
so there is no need to worry about them. 

16. Let ABC DEF be a hexagon, the vertices of which lie on a conic. Let A B and D E meet at X ; B C and E F 
meet at Y ; and C D and F A meet at Z. Prove that X, Y and Z are collinear. [Pascal’s theorem] 



17. Write down the projective dual of Pascal’s Theorem. [Brianchon’s theorem] 
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18. Let A, C and E lie on a straight line, and B, D and F lie on another straight line. Let A B and D E meet at X ; 

B C and E F meet at Y ; and C D and F A meet at Z. Prove that X, Y and Z are collinear. [Pappus’ 
theorem] 

19. Let ABC be a triangle, and let the tangent to its circumcircle at A meet BCatD. Let 1 be a line meeting A D 
internally at P, the circumcircle at Q and T, the sides A B and A C internally at R and S respectively, and B C 
at U. Suppose that P QRST U lie in that order on l. Show that if QR = ST then P Q - U T. [UK IMO 

Squad Practice Exam 2011, Question 2] 

20. Suppose Ti and 1L are two disjoint ellipses, with T\ inside r 2 . If there is at least one triangle with its sides 
tangent to T 1 and vertices on r 2 , show that there are infinitely many. [Poncelet’s porism] 

Actually, this theorem generalises to polygons with any number of sides. However, it is very difficult to prove 
with elementary methods. 

■ Suppose T1 and T2 are two disjoint ellipses, with T1 inside U. If there is at least one n-gon with its sides tangent to T1 
and vertices on T2, then there are infinitely many. [Poncelet’s porism] 

You may have noticed that Pappus’ theorem is a special case of Pascal’s theorem (and indeed Brianchon’s 
theorem, as it is self-dual) when the conic degenerates into a pair of straight lines. Both of these theorems can be 
considered to be special cases of the Cayley-Bacharach theorem. 



■ Three cubic curves each pass through the same eight points, no four of which are collinear and no seven of which are 
conconic. The three cubics then share a ninth point. [Degree-3 Cayley-Bacharach theorem] 


21. If two circles intersect, the radical axis is the line passing through both intersection points. For three 
mutually intersecting circles, prove that the three radical axes are concurrent. [Radical axis theorem] 


22. Three circles M N P, N L P and EM P have a common point P. A point A is chosen on circle MNP (other 
than M , N or P). AN meets circle N LP at B and A M meets circle LM P at C. Prove that B C passes 
through L. [UK MOG 2011, Question 1] 
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Cayley-Bacharach generalises to curves of arbitrary degree, but the generalised version is difficult to prove. The 
quartic version is given below. 

■ Three quartic curves pass through the same thirteen points, no five of which are collinear, no nine of which are 

conconic and no twelve of which are concubic. The three quartics then share a further three points. [Degree-4 Cayley- 
Bacharach theorem] 

23. Let the eight vertices of an octagon lie on a conic, and alternately colour the edges red and blue. Prove that 
the remaining eight heterochromatic intersections of (the extensions of) the edges lie on another conic. 

[Generalised Pascal’s theorem] 

24. Two cyclic quadrilaterals, ABC D and A' B' C D\ share the same circumcircle. The four intersections of 
corresponding edges (e.g. A B with A ' B ') are labelled P. Q, R and .S'. Show that if P, Q and R are collinear, 
then S also lies on this line. [Two butterflies theorem] 

25. Let P Q be the chord of a circle T, and let M be the midpoint of P Q. Chords A B and C D are drawn through 
M. Let A C and B D intersect P Q at R and S, respectively. Prove that M is the midpoint of R S. [Butterfly 
theorem] 

26. Let ABC D be a quadrilateral. A C and B D intersect at E. X and Y are two points in the plane, and the line 
X Y intersects AC at F and B D at G. R is the harmonic conjugate of F with respect to A C; S is the 
harmonic conjugate of G with respect to BD. The conics ABEXY and CDEX Y intersect a fourth time at 
P\ the conics BCEXY and DAEXY intersect a fourth time at Q. Prove that P QRS EX Y are conconic. 

[Sam Cappleman-Lynes, 2012] 

27. A number of line segments (l \, / 2 , • • •, /„) are drawn in general position on the plane, such that every pair of 
line segments intersects. A line A cuts all of the line segments. For each the endpoint on the left of A is 
called Aj, and the other endpoint is called B,. An ant walks along a line segment /, in the direction A,- -> B,. 
Whenever it hits £,, it teleports to A,. Whenever it meets an intersection point (/,- f) lj ), it moves onto the 
other line segment lj and continues moving (in the direction Aj -> Bj, still). Prove that there exists an initial 
position of the ant such that it visits every line segment infinitely often. 


Finite projective planes 


The applications of projective geometry in olympiad problems involve infinite projective planes, namely [RP 2 and 
CP 2 . The construction where we take three coordinates ( x , y, z) and consider scalar multiples to be equivalent 
generalises, enabling one to define a projective geometry over any field. For instance, the finite field of order 2 
results in a projective plane with seven points: (0, 0, 1), (0, 1, 0), (0, 1, 1), (1, 0, 0), (1, 0, 1), (1, 1, 0) and 
(1, 1, 1). This finite projective plane, considered to be the simplest non-trivial geometry, is called the Fano plane: 
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The seven lines are shown (in the left diagram) as six straight lines and one circle; it is impossible to embed it in 
the real projective plane using only straight lines. Although not obvious from the left diagram, the right diagram 
demonstrates that all points are equivalent. There are, in fact, no fewer than 168 symmetries, corresponding to the 
rotation group PSL(2, 7) of Klein’s quartic. Fixing a single vertex reduces the number to —^ = 24 symmetries, 

which are apparent in the following embedding in three-space. One point is at the centre of an octahedron formed 
by the other six points. The three orthogonal axes of the octahedron, together with the circumcircles of alternate 
faces, form the seven lines of the Fano plane. 



Coincidentally, this resembles an embedding (the Roman surface) of the real projective plane into (R 3 . 

28. The seven vertices of the Fano plane are each coloured with one of c colours. How many different 
colourings are possible, taking into account the symmetries? 
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Solutions 


i. 


2 . 


3. 


4. 


5. 

6 . 


7. 


8 . 


9. 


10 . 


11 . 


Distinct lines correspond to planes through R. They must intersect in a line through R, which corresponds to 
a point on the projective plane. 


The volume of the tetrahedron RABC, with coordinates (0, 0, 0), (x 1; y\, zi), (X 2 , V 2 , Z 2 ) and (X 3 , V 3 , 23 ), 


respectively, is given by ^ det 


Xi X2 X3 ' 

M yi ,V3 


. If this is zero, the points R, A, B and C must be coplanar. As we 


Z 1 Z2 Zi 

can assume without loss of generality that A, B and C lie on the reference plane, and R does not, we can 
deduce that A, B and C must be collinear. (The converse is obviously also true.) This condition that three 
points are collinear can be converted into the equation of a line, by allowing C to be a variable point. 


Assume P exists, and consider the case where the two triangles lie in different planes. The two planes must 
then intersect on a line, where A", B" and C" must obviously lie. The result for two dimensions is obtained 
by projecting it onto the plane. Note that the converse is the projective dual, so proving it in one direction is 
sufficient. 


Applying Desargues’ theorem results in the revelation that W V is parallel to B C. Hence, W XV is similar 
(indeed, homothetic) to C X B, so the result follows by applying the converse of the intersecting chords 
theorem to point X. 

Consider the reference plane and use the Pythagorean distance formula. 


A B CS PR 

Applying Ceva’s theorem and Menelaus’ theorem result in the two equations: — = 1 and 

BC SP RA 

DA C S PR 

— —■ —- = -1. Dividing the two equations yields the desired result. 

CD Jp RA 


A projective transformation of the projective plane P 2 is a linear transformation of the Euclidean space R 3 
in which it can be considered to reside. As planes are preserved by linear transformations, lines are 
preserved by projective transformations. 


Firstly, we have M x = M(p + Xq) = M p + XM q. However, this must be normalised in the obvious way, 

M p +A M q 

yielding ——=-—-. The dot product is distributive over addition. 

n • I M p +A M q\ 


Consider the four points a = p + a q,b_ = p + [i q, c = p + y q and d = p + 6 q. Calculating the cross-ratio 

— B) (y— S) 

before and after the projective transformation will result in the same value of-. 

F J U3-y)(6-a) 


Instead, we will find the inverse matrix. As we want the three unit vectors to be mapped to a, b and c, we 
write the (unnormalised) coordinates of each of them in each column of the matrix. To map (1, 1, 1) to J, 
we need to multiply each of the three columns by nonzero real constants, which are the solutions to three 
simultaneous linear equations. For obvious reasons, there is a single such solution. 


Firstly, transform the four original points to (1, 0, 0), (0, 1, 0), (0, 0, 1) and (1, 1, 1). Now, we can 
transform these to the four final points. Multiplying the two matrices together results in a single 
transformation. 
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12. Inverting about X results in a projective linear configuration, where we have to show that the intersections 
of opposite sides of quadrilateral ABC D are collinear with the projective harmonic conjugates of P with 
respect to each of the diagonals. We can projectively transform ABC D to the vertices of a square, 
rendering the problem trivial. 

13. Form two points Q\ and Qi in this manner. Apply a projective transformation taking them to the line at 
infinity, so P is the midpoint of two chords cutting an ellipse. Applying an affine transformation makes this 
ellipse into a circle, and P lies on two diameters so must be the centre. Obviously, the polar of P is the line 
at infinity. Applying the inverse projective transformations to this configuration will result in the original 
configuration, and the polar will remain a straight line. 

14. Instead, we will prove Chasles’ theorem (that a triangle and its polar reciprocal are in perspective) here, as 
the original problem is a special case. Due to Desargues’ theorem, we can prove that the corresponding 
sides of the two triangles intersect at collinear points. Without loss of generality, we can assume two of 
these points are at infinity, and our objective is to show that the third also lies at infinity. We can then apply 
a projective transformation to take the two intersections of the conic with the line at infinity (which exist, 
due to Bezout’s theorem) to the circular points at infinity (thus preserving the line at infinity, so this is an 
affine transformation), so we can assume that the conic is a circle. In this case, we have that the polar of A is 
parallel to B C, thus O A is perpendicular to B C. The polar of B is parallel to A C, thus OB is perpendicular 
to A C. Hence, O is the orthocentre of A B C, so O C is perpendicular to A B. This means that the polar of C 
is parallel to A B, so the triangles are in perspective. 

15. The polar reciprocal (or ‘dual’) is a regular icosahedron with insphere S. 

16. By projective transformations, we can assume the conic is a circle and the lines A B and D E are parallel, as 
are the lines A F and D C. From this, we draw the lines l\ and I 2 (through O, the centre of the circle) such 
that B and E are reflections of A and D, respectively, in l\, and that F and C are the reflections of A and D, 
respectively, in h. Hence, BE is a rotated copy of F C, so they are congruent. This means that B C and FE 
must indeed be parallel. 

17. If a hexagon is circumscribed around a conic, its three main diagonals are concurrent. 

18. Consider the degenerate case of Pascal’s theorem where the conic is two lines. 

19. We can reflect about the perpendicular bisector of R S (also the perpendicular bisector of Q T ) to transform 
this into a more projective problem. We will indicated a reflected version of point A with A', et cetera. We 
want to prove that the tangent at A meets the line B' C" somewhere on the line l. To do this, apply Pascal’s 
theorem to the degenerate hexagon AAA'C’B’B.AsAA' and B B ' intersect at the point at infinity on /, and 
A B and A' C' intersect at R (also on /), we have that the tangent at A indeed meets B' C' on /. By definition, 
this must be at point P and point U ', so P is the reflection of U in the perpendicular bisector of R S. The 
problem becomes trivial. 

20. In general, the ellipses must intersect four times on the complex projective plane; we can transform two of 
those points to the circular points at infinity, resulting in two circles. The problem then becomes equivalent 
to showing that infinitely many triangles share the same incircle (or excircle) and circumcircle. This is a 
consequence of Euler’s formula, O I 2 = Rr - 2 Rr, and dimension counting. 

21. We can consider the union of one circle and the radical axis of the other two to be a cubic. Repeating for the 
other two circles yields three cubics which intersect in at least eight points (the two circular points, plus the 
six pairwise circle intersections) so must intersect in the ninth. By applying Bezout’s theorem, this 
additional intersection point cannot lie on any of the circles, so must lie on all three radical axes. 

22. Same argument as in the previous question, but applied to the diagram displayed near the question. 
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23. Let Q\ be the union of red lines, Qi be the union of blue lines, and Qt be the union of the main conic with 
the conic passing through five of the other eight heterochromatic intersections. By the quartic version of 
Cayley-Bacharach, Qa must pass through the other three intersections. 

24. This is essentially the same argument as before, but with the realisation that a conic passing through three 
collinear points implies that it is a degenerate conic (union of two lines). Four of the eight ‘other 
heterochromatic intersections’ must lie on one line, and the other four lie on another line. The result then 
follows. 

25. Reflect the ‘butterfly’ A CRD in the perpendicular bisector of P Q to create another butterfly. A' C" B'D'. 
Colour A B, B' C', C D and D' A ' red, and the remaining four edges blue. Since four of the heterochromatic 
intersections must be collinear (lying on the mirror line), so must the other four. Flence, D' B' and A C 
intersect at R, and the mirror image must be S. (This is a degenerate case of the ‘two butterflies theorem’.) 

26. Apply a projective transformation to send X and Y to the circular points at infinity. The problem is then 
reduced to question 12. 

27. This is the projective dual of Geoff Smith’s ‘windmill problem’ from IMO 2011. An official solution can be 
easily obtained from the Internet. 

28. There is one identity permutation. If we choose to fix one vertex and consider the three-dimensional 
embedding, we have eight rotations which permute the remaining six vertices in two 3-cycles. As there were 
seven vertices to initially choose from, this results in 56 pairs of 3-cycles. We can also rotate by n about any 
of the six ‘diagonal’ axes, resulting in another 42 permutations, each comprising a 2-cycle and 4-cycle. A 
rotation by n about an orthogonal axes gives a pair of 2-cycles and fixes three collinear points; this gives 21 
further permutations. There must be a 7-cycle due to the floral embedding of the Fano plane. When 
translated into the three-dimensional embedding, this becomes totally asymmetric, so we have 24 7-cycles 
in this conjugacy class. Moreover, reversing their direction gives 24 more 7-cycles, completing the list of 
168 symmetries. Applying Burnside’s lemma, we get (c 7 + 56 c 3 + 42 c 3 + 21 c 5 + 48 c) distinct 
colourings. 
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Complex numbers 

We assume that you are familiar with complex numbers in algebra, and delve immediately into the use of complex 
numbers in geometry. We are able to use complex numbers in two-dimensional geometry because the Euclidean 
plane, IR 2 , is isomorphic to the complex plane, C. We represent a point with Cartesian coordinates (x, y ) with the 
complex number z = x + i y, where i is the imaginary unit. Complex numbers are superior to two-dimensional 
vectors in that rotations are easy to define, as we shall see shortly. 


Basic properties of complex numbers 


Complex numbers can be added, subtracted and multiplied by real numbers in precisely the same way that vectors 
can. They have a magnitude and argument, which correspond to the length and direction of a complex number. 
Also, for a complex number z = x + iy, we define its complex conjugate to be z* = x - i y. As i and -i have 
definitions analogous to ‘left’ and ‘right’, we can interchange all instances of i with -i in an algebraic equation 
without affecting anything. Hence, (z w)* = z * w* and (z + w)* = z* + w*. 



In the above diagram, arg(z) = 6 is the argument of z, and | z \ = r is the magnitude (or modulus) of z. An 
important identity is that zz* = r 2 , which enables the (squared) modulus of a complex number to be calculated. 
Using vector subtraction, this gives us A B 2 = ( a - b) (a* - If ) for the squared distance between two points. 

If we multiply a complex number with polar form (r t , 6 \) with another complex number (r 2 , 6b), it is easy to 
verify that we obtain the complex number (u r 2 , Q\ + 6b). Hence, arg(zi z 2 ) = arg(zi) + argfe) and 
| Zl Z2 | = | zi | | Z2 |- 

We can define the inner product in terms of complex numbers as z-w = - (zw* + w z*), which is analogous to the 
dot product of vectors. Similarly, we define the exterior product as zxw = ^ i(wz* - zw*), which resembles the 
cross product. 

1. Prove that the area of A O B is given by [AO B] = ^ (a x b) = ^ i(b a* - ab*). 


2. Hence prove that the area of triangle ABC is given by 

6 1 1 1 \ 


[ABC] = - (bxa + cxb + axe) = i det 


b 

b* 


. [Area of a triangle] 
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3. If we have a triangle and erect equilateral triangles on its sides, prove that the centres of those equilateral 
triangles themselves form an equilateral triangle. [Napoleon’s theorem] 


Angles, circles and concyclicity 


4. Show that the directed angle LABC = arg(a - b) - arg(c - b) = arg(^). 

5. Hence deduce that lABC = lADC (mod n) if and only if {a b){t d) is real. [Real cross ratio <=> 

( b-c ) (d-d) 

concyclicity] 

More specifically, if this value is equal to -1, then ABCD is known as a harmonic quadrilateral. Harmonic 
quadrilaterals are covered in the chapter on projective geometry. 

6. Show that the equation of a circle with centre P and radius r has the equation 
zz* - p z* - p* z + p p* - r 2 = 0. [General form of a circle] 

7. Prove that four points, AB C D, are mutually concyclic (or collinear) if and only if 
(,a - b) (b* - c*) (c - d) (d* - a*) = (a* - b*) (b - c) (c* - d*) (d - a). 


8. Prove that four points, AB C D, are mutually concyclic (or collinear) if and only if 

1111 


det 


a b 
a* b* 
a a* b b* 


c 
c * 
cc* 


d 
d * 
dd* 


= 0 . 


The observant amongst you may notice that the previous two questions are equivalent quartic expressions. This 
demonstrates the equivalence between the ‘angles in the same segment’ and ‘equidistant from a common point’ 
conditions for concyclicity. Any quadratic function, which vanishes only on the circumference of a circle, must 
necessarily be proportional to the power of a point with respect to that circle. This gives us a more general result, 
which I believe has yet to be published elsewhere: 


For any four points, ABCD.no three of which are collinear, we have 


det 


1 

1 

1 

1 

a 

b 

c 

d 

a* 

b* 

c* 

d * 

a a * 

bb* 

cc* 

dd* . 


-4 i [A B C] Power (P, ABC). [Goucher’s theorem] 


Interestingly, it was noted that at almost the same time a problem particularly vulnerable to this theorem was 
proposed at an International Mathematical Olympiad. There are ordinary Euclidean methods of proving this, but 
they are less inspired and do not explain why this result should hold. 


9. Suppose we have a non-cyclic quadrilateral, P\ P 2 P 3 Pa- Let 0\ and R\ be the centre and radius, 

respectively, of the circumcircle of P 2 P 3 Pa , and define O 2 , O 3 , Oa and R 2 , R 3 , Ra similarly. Show that 

= 0. [IMO 2011 shortlist. Question G2] 


O, Pi-Rr 


Oj P2 ~Pl‘ 


0 3 P 3 z -R 3 - 


Oa Pa ~Ra 


Reflections and rotations 


A useful property of complex numbers is the ability to express rotations and reflections rather simply. We have 
concise expressions for reflection about the real axis (complex conjugation), rotation about the origin 
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(multiplication by a unit complex number) and translation (addition of a complex number). Here are the three 
‘elementary’ operations: 

• A translation parallel to the vector O A is represented by z -» z + a. [Translation] 

• An anticlockwise rotation about the origin by the angle 9 is represented by z -» z e lB . [Rotation about the 
origin] 

• A reflection in the real line is represented by z -> z*. [Reflection in the real axis] 

These become more useful when one realises that they can be composed to yield any Euclidean transformation. 

10. Show that an anticlockwise rotation by 9 about the point A is represented by z -> z e' 8 + a(l - e' 8 ). 

11. Hence demonstrate that an arbitrary direct congruence is a transformation of the form z -> az + b, where 
a a* = 1. 


If we relax the condition that a a* = 1, we obtain the result that two directly similar figures can be related by a 
transformation of the form z-* az + b. As this is a linear function, we can linearly interpolate between any two 
directly similar figures to obtain a third directly similar figure. Specifically, if triangles ABC and A’ B' C" are 
directly similar, then the (optionally weighted) midpoints of A A ', B B' and C C" form a third similar triangle. 



This is known as the fundamental theorem of directly similar figures. In the diagram above, the red triangle is the 
‘arithmetic mean’ of the blue and green triangles. If we have five directly similar figures in general position, then 
any directly similar figure can be expressed as a ‘weighted mean’ of those five. 

12. Show that a reflection in the line z e~ l8 e R is represented by z -» z* e 2lB . 

13. Hence demonstrate that an arbitrary indirect congruence is a transformation of the form z -> az* + b, where 
a a* = 1. 

In two dimensions, direct congruences can be either translations or rotations. Indirect congruences can be either 
reflections or glide-reflections. A glide-reflection is a composition of a reflection in a line and a translation 
parallel to the line. 



14. Show that a glide-reflection has no fixed points. 
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15. Hence demonstrate that a reflection in the line B C is represented by z -> [b ‘ >u : h ] + b. 

16. If b b* = c c* = R 2 , show that a reflection in the line B C is represented by z -» b + c - . 

The comparative complexities of the previous two expressions show that the calculations become simpler when 
we assume that the circumcentre of a triangle ABC is the origin. This is explored more thoroughly in a later 
section of this chapter. 

17. Let ABC be a triangle, and P be a point in the plane. Let the reflections of B in B C, C A and A B be D, E 

and F, respectively. Prove that ^—-r =--—. [Euler’s formula for pedal triangles] 

[A B Cj 

As a special case of the above, we have the Simson line property: 

■ Let ABCbea triangle, and P be a point in the plane. Then the reflections of P in B C, C A and A B are collinear if 
and only if P lies on the circumcircle of ABC. Moreover, the orthocentre H lies on this line. [Dilated Simson line] 

Usually, these results are quoted when D, E and F are the feet of the perpendiculars from P, rather than reflec¬ 
tions of P. However, Euler’s formula is more elegant with this version, and the standard Simson line in general 
does not contain H. 


Triangle inequality 


One of the more rudimentary inequalities governing vectors (and thus complex numbers) is the triangle inequality. 

■ If a and b are nonzero complex numbers, then \ a + b\ < | a | + [ 6 [, with equality if and only if a is a positive real 
multiple of b. [Triangle inequality] 

This follows immediately from the following configuration, together with the notion that the shortest path 
between two points is a straight line. 



0 


18. Show that ( a - b) (c - d) + (a - d) (b - c) = (a - c) ( b - d ), where a , b, c, d e C. 

19. Hence prove that \ a — b\ | c - rf | + | a - d [ \b — c \ > \a — c\ \b -d\, with equality if and only if 

(a-b)(c-d) . . , 

-is a negative real. 
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20. Let ABC D be a quadrilateral. Show that ABCD + BCDA>ACBD, with equality if and only if 
ABCD is a convex cyclic quadrilateral. [Ptolemy’s inequality] 


Casey’s theorem 


The equality case of Ptolemy’s inequality can be regarded as a special case of Casey’s theorem. 

■ Let T be a circle, and 9j, 9 2 , Q 3 and V 4 be four circles tangent to T at P t , P 2 , P 3 and P 4 , respectively. The chords 
P 1 P 3 and P 2 Pa intersect inside L For each pair of circles 9, and 9 j, we let d(i, j) denote the length of the common 
outer tangents if 9, and 9 j are both on the same side of T, or the length of the common inner tangents if they lie on 
opposite sides of T. Then we have d( 1, 3)-d(2, 4) = d(l, 2)-d{3, 4) + d( 2, 3)-r/(4, 1). [Casey’s theorem] 

The following exercise demonstrates how the latter can be inferred from the former using some basic 
trigonometry. Firstly, we consider a circle tangent externally to T to have positive radius, and a circle tangent 
internally to T to have negative radius. T itself is considered to have negative radius. 



> 


Erecting cones and ‘anticones’ on the circles with positive and negative radii, respectively, gives the diagram 
shown above. 

21. If V 1 and 9 2 are two circles with radii r\ and r 2 and centres 0\ and On, respectively, then show that 
dr = 0\ Or - (j\ - r 2 ) 1 , where d is the length of the common outer tangents. 

The value of d is dependent only on the positions of the centres and difference between the radii. This means we 
can fix the centres and uniformly increase the radii (using the sign convention described above) of all five circles 
in the problem by the same amount, without changing the values of d(i, j) or affecting the tangency of the circles. 
(In the three-dimensional diagram, this is equivalent to moving the horizontal reference plane upwards or 
downwards.) So, we can assume without loss of generality that T is a single point (circle of zero radius) through 
which each 9, passes. This greatly simplifies the analysis. 

■ Let P be a point, and 9i, 92 , 93 and 94 be four circles passing through P with centres 0\, O 2 , O 3 and O 4 , 

respectively. The V-shaped line 0\ P O 3 separates the plane into two regions; 0 2 and O 4 lie in opposite regions. For 
each pair of circles 9 ; and 9 j, we let d(i, j) denote the length of the common outer tangents. Then we have 
d( 1, 3)-d{2, 4) = d( 1, 2)-d{3, 4) + d( 2, 3) -d{4, 1). [Simplified Casey’s theorem] 
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By proving the simplification of Casey’s theorem, we will therefore implicitly prove the original theorem. 

22. In the above problem, let Vi and Q 2 have radii r\ and ri, respectively. Show that d( 1, 2 ) 2 = 2 r\ r 2 (l - cos 60, 

A 

where 9 = 0\ P O^- 

23. Hence show that d( 1, 3) -d{2, 4) and the other terms in the simplified Casey’s theorem are unaffected when 
r \, ri, r 3 and r 4 are simultaneously replaced with their geometric mean. 

24. Hence prove the simplified Casey’s theorem. 

Casey’s theorem, like Ptolemy’s theorem, has a converse. If we have four (directed) circles and 
d( 1, 3)-d(2, 4) = d( \, 2)-d(3, 4) + d(2, 3)-d(4, 1), then there exists a fifth circle tangent to all four circles. This is 
the basis of the shortest known proof of Feuerbach’s theorem, demonstrating that there is a circle (the nine-point 
circle) tangent to the incircle and three excircles of a generic triangle. 
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Solutions 


1. 

2 . 


Suppose lAOB = 6. Then we have [A 0 B\ = ^ | a | | b | sin 6 = ^ (a x b). 

[A B C] = [B 0 A] + [C OB] + [A O C] = \ {bxa + cxb + axe). Using the formula for cross product, this 


equals 


\{ab* 


■ba* + be* -cb* +ca* 


■ ac*) = - i det 

4 


I 1 1 \ 

a b c 

, a* b * c* , 


3. Assume, without loss of generality, that the triangle is labelled anticlockwise. Let x, y and z be the centres 
of the equilateral triangles erected on B C, C A and A B, respectively. We have x = (b + c - cj c - to 1 b), 

where a> = e* ,n is a cube root of unity. This gives us x - y = - [a(u - 1) + b{\ - a> 2 '] + c(to - 1)). The 

symmetrical expression means that (x - y) = aXz - x), which is a sufficient condition for the triangle to be 
equilateral. 

4. We translate the configuration so that B is the origin, and A and C are represented by complex numbers 

a - b and c - b, respectively. Hence, we have lABC = arg(a - b) - arg(c - b), as required. The final part of 
the proof, namely that this also equals arg(-^), follows from the prosthaphaeretic property of the arg() 
function. 


5. L AB C = LAD C (mod n) <=> argf-^) = arg(-^) (mod n) <^=> (a b){c d) e [R. 

6. | z - p | = r <=> (z - p) (z* - p*) = r 2 z z* - p Z* - p* Z + p p* - r 2 = 0. 

7. This is a consequence of the ‘angle in the same segment’ theorem for concyclicity and the result of Question 

5. As b) ^ d ^ g R, it must be equal to its complex conjugate d .j . We then multiply throughout by 

the common denominator, giving the quartic equation 

(a - b) ( b* - c*) (c - d) (d* - a*) = (a* - b*) (b - c) (c* - d*) (d - a). 


8. Suppose D is variable and A, B and C are constants. The equation multiplies out to the form in Question 6, 
so is the condition that D lies on some circle. As the determinant vanishes whenever two columns are equal, 
D = A, D = B and D = C all satisfy this equation. Hence, it must be the circumcircle of ABC, and we are 
done. 


9. Multiply through by - i det 


1 
a 
a* 
a a* 


1 
b 
b * 
bb* 


1 

c 

c* 

cc* 


1 
d 
d * 
dd * 


and use Goucher’s theorem. This leads to the 


equivalent statement about signed areas [ABC] + [C D A] = [BCD] + [DAB], 


10. To obtain this transformation, we compose a translation by A O, rotation by 6 anticlockwise about O, and a 

translation by O A. They have the formulae z -» z — a, z -> ze‘ e and z -» z + a, respectively. Composing these 
in order yields z-> (z - a) e ,B + a = ze ,B + a( 1 - e ‘ fl ). 


11. As z -» a z + b is a linear transformation, it is closed under composition. Translations and rotations are of 
this form, ergo every rigid transformation is. 
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12 . 

13. 


14. 

15. 


16. 

17. 


18. 

19. 


20 . 

21 . 


22 . 


23. 


Again, we compose a rotation by 8 clockwise (z -> z e l(l ), a reflection in the real axis (z -» z*) and a rotation 
by 9 anticlockwise (z -> ze‘ s ). The composite transformation has rule z -» z* e 2,e . 


Composing a function of the form z~* az* + b with any linear function results in another function of the 
form z -> a z* + b. All indirect congruences can be built from a reflection in the real axis and a rigid 
transformation, so must also be of this form. 


Composing a glide-reflection with itself results in a translation, which clearly has no fixed points. 

The reflection must have the form z -> p z* + q, where p p* = 1, as we demonstrated earlier. Having B and C 
as fixed points (as the transformation does indeed) proves that it is not a glide reflection, and must be the 
reflection in the line B C. 


The reasoning is identical to the previous question. 


Without loss of generality, assume a a* = b b* = c c* - R 2 . Then d = b + c - —jrr , and e and / have similar 
forms. The determinant (proportional to the area) is given by £ (d e* - e d*), where £ denotes the cyclic 

eye eye 

sum interchanging a, b and c. This evaluates to 


det 


1 1 1 

d e f 

d’ e* /* 


= Yj \cib* — ba* + 
eye 


ba“ p p* ab“ p p’ 


)= Z (ab* — b a*)[\ - ) = det 

eye 


1 1 1 

a b c 
a* b* c* 


l - — I. Or, in other words, 

R 2 1 ’ 


[DEF] = [ABC]( 1-^f). 


Both sides of the equation expand to ab-bc + cd-da. 


As the modulus function is multiplicative, this is equivalent to 

\(a - b)(c - d) \ + \ (a - d)(b - c) \ > | ( a - c){b - d) |. This is the triangle inequality, so equality only 
holds when Ul b)u ll) is a positive real, or — /)>u d) is a negative real. 

(a-d) (b-c) v (b-c) (d-a) 6 


The real cross-ratio condition forces A, B, C and D to be concyclic. If A and C lie on the same side of B D, 
the cross-ratio would be positive; hence, chords A C and B D must intersect. 


Assume, without loss of generality, that r t > r 2 . Let one of the common outer tangents meet I'i at A and V 2 
at B. Further, let C be the point on the radius 0\ A such that AC = r 2 and O t C = r\ - r 2 . As C 0 2 B A is a 
rectangle, we have d = A B = C 0 2 (not carbon dioxide!). By applying Pythagoras’ theorem to the triangle 
0\ 0 2 C, we obtain C 0 2 2 = 0\ 0 2 - C 0\ 2 , each term of which is equal to d 2 = 0\ 0 2 2 - ( r\ - r 2 ) 2 . 


Using the formula from the previous question, we have d( 1, 2) 2 = 0\ 0 2 2 - (t'\ - r 2 ) 2 . The cosine rule gives 
us 0\ 0 2 2 = r\ 2 + r 2 - 2 r\ r 2 cos 8, and multiplying out yields ( r\ - r 2 ) 2 = r\ 2 + r 2 2 — 2r\ r 2 . The difference 
between these expressions is 2 r\ r 2 ( 1 - cos 9), as required. 


The previous question results in (d( 1, 3) -d(2, 4)) 2 = 4 r\ r 2 r 2 r\( 1 - cos 6) (1 - cos cp), where 9 and tf> are 
defined in the obvious way. This is unaffected when we replace each of r\,r 2 , r 2 and r 4 with 
r - V r i r 2 r 2 , as the product remains equal to r 4 . Hence, the value of d(l, 3)-d(2, 4) also remains 
invariant. By symmetry, so do the other terms. 


24. We can assume without loss of generality that /y = r 2 = r 2 = r 4 = r. The distance d(\, 2) = 0\ 0 2 , et cetera. 
Since 0\, 0 2 , 0 2 and O 4 lie on a circle by Ptolemy’s theorem, we are done. 
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Triangle geometry 

In this chapter, we consider the basic properties of a generic triangle ABC, and how the angles and distances 
between points are related. We explore parametrisations of the triangle using both trigonometry and complex 
numbers. In the process, we develop an arsenal of identities suitable for attacking both geometrical and 
trigonometrical problems, noting the interchangeability between the representations. 


Trigonometry 


The elementary trigonometric functions, namely sine and cosine, can be expressed in terms of the exponential 
function and vice-versa. 


■ sin(O) = ImiV "j = ' n '' —. [Definition of sine] 


■ cos (8) = Re|V = -— j —. [Definition of cosine] 

We can view this on the Argand plane, where we consider a point on the unit circle with Cartesian coordinates 
(cost?, sin0) and complex representation cos0+ i sin0= e‘ e . From applying Pythagoras’ theorem, we instantly 
obtain the famous identity sin 2 9 + cos 2 9=1. 



This is arguably the most reliable approach to proving trigonometric identities, as it is a simple matter of 
converting each expression to its exponential counterpart and verifying that both sides of the equation are indeed 
equal. However, it is preferable to derive a few identities first, as working with lots of exponentials can be 
laborious. Perhaps the most rudimentary trigonometric identities are the compound angle formulae. 

1. Prove that sin(F + 0) = sin 9 cos 0 + sin 0 cos 9. [Compound angle formula I] 

2. Similarly, prove that cos(0 + 0) = cos 9 cos 0 - sin 0 sin 9. [Compound angle formula II] 

With these, it is no longer necessary to rely on the exponential form for proving identities. Indeed, we can now 
avoid using complex numbers altogether. 

More sophisticated trigonometric functions can be expressed as ratios of sine and cosine. 

■ tan 0 = ; cot 8 = ; sec 6 = —; cosec 8 = — [Definitions of tangent, cotangent, secant and cosecant] 

cose sin B cos e sin e 
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This enables us to derive a compound angle formula for the tangent function. 

3. Hence prove that tan(0 + 0) = tjn w+tdn ^ [Compound angle formula III] 

[ tan 0 tan (!) 

As special cases of the above, where 8 = tf>, we obtain the double-angle formulae. 

■ sin(2 8 ) = 2 sin 8 cos 8 . [Double-angle formula I] 

■ cos(2 9) = cos 2 9 - sin 2 8. [Double-angle formula II] 


4. Prove further that cos(2 6) = 2 cos 2 0- 1 = 1-2 sin 2 8. [Extended double-angle formula] 

We can rearrange the above formula to obtain sin 2 8 and cos 2 8 in terms of cos 2 8. Hence, we can calculate the 
value of cos ^ from that of cos ^, for example. 

5. Hence deduce that cos (8 + 0) cos(0 -(/>) = - (cos 2 8 + cos 2 0) = cos 2 8 - sin 2 (/>. [Prosthaphaeresis] 

The compound angle formulae can be used recursively to derive expressions for three angles. 

6. Prove that sin(3 8) = 3 sin 8 - 4 sin 3 8. [Triple-angle formula] 

More generally, we can expand sin(0 + 0 + 0) to obtain sin(0 + 0) cos 0 + cos(0 + 0) sin 0, then apply the 
compound angle formulae again to each term. This results in the following expression: 

■ sin(0 + 0 + 0) = sin 9 cos 0 cos 0 + cos 8 sin 0 cos 0 + cos 8 cos 0 sin 0 - sin 9 sin 0 sin 0. [Compound angle formula 

IV] 

Let A, B and C be the angles of a triangle opposite sides of lengths a , b and c, respectively. R, r and s are the 
circumradius, inradius and semiperimeter, respectively. We can apply the trigonometric identities explored in the 
previous section to triangles, remembering that A + B + C = n, and thus sin(A + B) = sin C and 
cos(A + B) = -cos C. 

7. Prove that sin A sin B cos C + sin A cos B sin C + cos A sin B sin C - cos A cos B cos C- 1, and thus 
cot A + cot B + cot C - 1 = cosec A cosec B cosec C. 

8. Prove that tan A tan B tan C = tan A + tan B + tan C, and thus cot A cot B + cot B cot C + cot C cot A = 1. 
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Altitudes and orthocentre 


Consider the triangle ABC together with its orthocentre H. The altitudes meet B C, C A and A B at D, E and F, 
respectively. 



The orthocentric configuration exhibits a plethora of particularly interesting properties: 

• The reflections of the orthocentre, H, in each of the sides of the triangle land on the circumcircle. 

• If three points out of {A, B, C, D, E, F, H) are collinear, the remaining four are concyclic, and vice-versa. 

• If // is the orthocentre of ABC. then A is the orthocentre of H B C, et cetera. This is called an orthocentric 
quadrangle or perpendicularogram. All four triangles share the same nine-point circle, the centre of which is 
the barycentre of {A, B. C, H). 

• Due to cyclic quadrilaterals, we have lDHB=lAHE=lBCA. 


• Every inter-point distance has a simple expression in terms of the circumradius and trigonometric functions of 
the vertex angles: 



A 

B 

C 

D 

E F 

H 

A 

0 

2 R sin(C) 

2 R sin(fi) 

2R sin(B) sin(C) 

2 R cos(A) sin(C) 2 R cos(A) sin(fi) 

2Rcos(A) 

B 


0 

2 R sin(A) 

2Rcos(B) sin(C) 

2 R sin(A) sin(C) 2 R sin(A) cos (B) 

2 R cos {B) 

C 



0 

2 R sin(fi) cos(C) 

2 R sin(A) cos(C) 2 R sin(A) sin(B) 

2Rcos(C) 

D 




0 

2 R sin(C) cos(C) 2 R sin(B) cos {B) 

2Rcos(B) cos(C) 

E 





0 2 R sin(A) cos(A) 

2Rcos(A) cos(C) 

F 





0 

2Rcos(A) cos(B) 

H 






0 


• Every rectangular hyperbola passing through A, B and C also passes through H. The centre of the hyperbola 
lies on the nine-point circle of triangle ABC. 
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Tritangential circles 


We consider the four circles tangent to all three sides of a triangle, together with their centres and tangency 
points. One of these circles is enclosed by the triangle (the inscribed circle, or incircle, with incentre I), and the 
other three are called escribed circles, or excircles. Collectively, they are known as tritangential circles. 



As the two tangents from a single point to a circle are equal, we have that AP B = A Pq. Let l, m and n denote the 
distances A P B , B P c and C Pa, respectively. We have a = m + n, b = n + l and c = l + m. This enables us to 
deduce that A P B = A P c = l = s - a, where ,v = ^ (a + b + c) is the semiperimeter of the triangle ABC. We know 

that C Yi = C Y 2 , and that BY\ = B Q c , which gives us C B + B Q c = Q c A + A C = s, from which we can deduce 
that B Q c = s - a. 

m A P c = B Q c = s - a. [Distances to intouch and extouch points] 

This means that the line segments P c Q c and A B share a midpoint. 

Applying Pythagoras’ Theorem to triangle AP B I enables the distance AI 2 = A P B 2 + P B I 2 = (s - a) 2 + r 2 to be 
determined. Similarly, we have A I c 2 = (s - b ) 2 + r c 2 and AI A 2 = s 2 + r A 2 . 

9. Prove that r s - r A (s - a) = r B (s - b) = r c (s - c) = [AB C]. [Area of a triangle] 

10. Show that tan — = — = —. [Half-angle formula] 

2 s-a s 

11. Show that A1 = 4 R sin sin and thus r = 4 R sin ^ sin sin ^. [Inradius formula] 

^4 

12. Prove that s - b = rc tan —. [Complementary half-angle formula] 

With the inradius formula and half-angle formula, we obtain an expression for 5 - a; by symmetry, we also get 
s — b and s — c. The complementary half-angle formula gives us r A , r B and rc, and the bog-standard half-angle 
formula gives us s. These eight quantities are included in the table below so you can gape in awe at the elegant 
symmetries between the formulae. 
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Length 

Trigonometrical expression 

r 

4/?sin(|)sin(|)sin(|) 

s - a 

4/?cos(|)sin(!)sin(|) 

s-b 

4/?sin(|)cos(!)sm(!) 

s-c 

4/?sin(^)sin(|)cos(-f) 

fA 

4tf sin(4)cos(!)cos(-^) 

fB 

4tf cos(!)sin(|)cos(-f) 

rc 

4tf cos(4)cos(|)sin(-^) 

s 

4f?cos(^)cos(|)cos(|) 


13. Prove that [ABC] = V r r A r B r c = V .vf.v - a) (s - h) (s - c) = — ' L . [Beyond Heron’s formula] 

4 R 


14. Hence prove that cot A = b +c g . 

1 4[A B C] 

15. Show further that [ AB C] = ^ -\j a 2 b 2 c 2 sin A sin B sin C . [Gendler’s formula] 

16. Prove that sin A + sin B + sin C = 4 cos — cos - cos — = —. 

2 2 2 R 

Here are some miscellaneous properties of the tritangential circles: 

• If the lines CI Ic and A B intersect at R, then (C, /v 1 ; /, I c ) = -1 is a harmonic range. 

• The lines A P A , B P B and C Pc are concurrent at the Gergonne point Ge, whilst the lines AQ a , B Q b and C Qc 
are concurrent at the Nagel point Na. They are isotomic conjugates. The incentre, centroid, Spieker centre 

(incentre of the medial triangle) and Nagel point are collinear such that 3/G = 6GSp = 2Sp Na. This is 
known as the Nagel line, and is not unlike the Euler line. 

• Feuerbach’s theorem states that the incircle and three excircles are tangent to the nine-point circle. 

• The excentral triangle I A 1 B I c has orthocentre I. Its nine-point circle is the circumcircle of the reference 
triangle. 


Brocard points 


The first Brocard point PI is positioned such that Lil R C = Lll C A = L11A B = oj, where to is known as the 
Brocard angle. The second Brocard point PI' is its isogonal conjugate, where LPl'BA = LPl'CB=LPl'AC = u). 
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17. Prove that (cot u> - cot A) (cot to - cot B) (cot to - cot C) = cosec A cosec B cosec C. 


18. Hence show that cot to is a root of the cubic equation 

x 3 - (cot A + cot B + cot C) x 2 + x - (cot A + cot B + cot C) = 0. 


Let be the circle through A and B tangent to BC, and define V BC and r c A similarly. £1 lies on the 
intersection of the three circles. The other triple intersections of these three circles are the two circular points at 
infinity, which correspond to the imaginary roots of the cubic equation. 

19. Show that the above equation has only one real root, and thus cot oj = cot A + cot B + cot C. [Brocard angle 
formula] 


Now that we have this expression for cot cj, we can derive further identities: 


20 . 


Prove that tan to = 


sin A sin B sin C 
1 +cos A cos B cos C 


xj2 I u2 | r 2 

21. Prove that cot to =-. 

4[A B C] 

22. Hence show that to < - , with equality if and only if A B C is equilateral. 

23. Let P be a point interior to a scalene triangle ABC. Prove that one of the angles lPAB,lPBC and 
LP C A must be less than ^. [Adapted from IMO 1991, Question 5] 


R , r and s 


When evaluating (squared) distances between triangle centres, one often finds a symmetric polynomial in the side 
lengths (a, b and c). It is convenient to convert this into an expression in the circumradius K. inradius r and 
semiperimeter s. 

24. Using the formulae [ABC] = rs = = V s(s - a) (s -b)(s — c) , derive expressions for a + b + c, 

4 R 

ab + bc + ca and a b c in terms of R, r and s. 

Due to Newton’s theorem of symmetric polynomials, it is possible to express any symmetric polynomial in the 
side lengths in terms of these elementary symmetric polynomials, and thus in terms of R, r and s. 
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Symmetric polynomial 

R, r and s 

a + b + c 

2s 

ab+ac+bc 

r 2 +4rR + s 2 

abc 

4 rRs 

a 2 + b 2 + c 2 

-2 r 2 - 8 rR + 2s 2 

a 3 + b 3 + c 3 

2s{-3r 1 -6rR + s 2 ) 


When dealing with inequalities in the side lengths of a triangle, it is most convenient to convert it into an 
inequality in l = s - a, m = s - b and n = s - c. The triangle inequality is equivalent to /, m and n being positive 
reals. Symmetric polynomials in /, m and n can similarly be converted into polynomials in R, r and 5 using this 
method. 



Symmetric polynomial 

R, r and s 

l + m + n 


lm + ln + m n 

r 2 + 4rR 

1 m n 

r 2 s 

l 2 + m 2 + n 2 

-2r 2 -8rR + s 2 

1 3 + m 3 + n 3 

i 3 - 12 rRs 


25. Prove that, for every triangle, s 2 > 16 R r - 5 r 2 . [One half of Gerretsen’s inequality] 

26. Similarly prove that s 2 < 4 R 2 + 4 R r + 3 r 2 . [Other half of Gerretsen’s inequality] 

We can go further and find the tightest bounds possible for s 2 given R and r. As /, m, n are three real roots of the 
equation (x - l) (x - m) (x - n) = 0, it is a necessary and sufficient condition for the cubic 
x 2 - (I + m + n) x 2 + (/ m + m n + nl)x-lmn = 0 to have three (not necessarily distinct) real roots. Using the 
formulae for the elementary symmetric polynomials, we require the discriminant of 
x 3 - s x 2 + (r 2 + 4 R rj x - r 2 s x = 0 to be non-negative. The discriminant is given by 

18 r 2 s 2 (r 2 + 4 R r) - 4 r 2 s 4 + j 2 (r 2 + 4 R r'j - 4 (r 2 + 4 R r) 3 - 27 r 4 s 2 , which is a quadratic function in s 2 . Solving 
this inequality gives us the following necessary and sufficient condition on s 2 in terms of R and r. 
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m 2 R 2 + 10 R r - r 2 - 2 ^ R(R - 2 r) 3 < s 2 < 2 R 2 + 10 R r - i 2 + 2 y/ R(R - 2 r) 3 . 

The semiperimeter will fluctuate between these two values as the vertices of the triangle move around the circum- 
circle in Poncelet’s porism. 

27. Show that R>2r. [Euler’s inequality] 

28. Hence prove that sin A + sin B + sin C > sin 2 A + sin 2 B + sin 2 C. [Gendler’s inequality] 

29. Express cottu in terms of R, r and .v. 

ABC r 

30. Prove that cos A + cos B + cos C = 1 + 4 sin - sin - sin - = 1 + -. 

2 2 2 R 

31. Hence prove that cos A - cos B - cos C = 1 - 4 sin — cos -f cos — = 1 - —. 

F 2 2 2 R 

32. Show that 4 R = r A + r B + r c - r. 


Complex parametrisation of triangles 



Consider the reference triangle ABC. If we represent the vertices with complex numbers u 2 , v 2 and w 2 , 
respectively, such that u u* = v v* = w w* = R, and choose the signs of u, v and w such that u v lies on the arc A B 
containing C (and cyclic permutations thereof), then most of the useful aspects of the triangle have simple 
algebraic expressions: 
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Quantity 

Symbol 

Expression 

Circumradius 

R 

UU = V V = ww 

Inradius 

r 

^ (— V u — u v* — w u — uw* — wv* — v w*) — R 

Exradius opposite A 

fA 

1/* * * * * *\r> 

- y—vu —uv — w u — uw 1 wv +vw)+K 

Semiperimeter 

s 

t(v« -uv — w u + uw + wv -vw)) 

(Signed) area 

[ABC] 

^ i ( 'u 2 (v*)“ - u 2 (w*)“ - v 2 (m*) 2 + 



w 2 (uf + v 2 (w*) 2 -w 2 (v*) 2 )) 

Side length of BC 

a 

i (w v - v w*) 

Angle exponential 

e‘ A 

WV 

R 

Sine 

sin(A) 

i(wv —vw ) 

2 R 

Cosine 

cos(A) 

— wv -vw 

2 R 


There are also versions of these formulae expressed in linear factors, which can easily be multiplied and divided: 


Quantity 

Symbol 

Expression 

Inradius 

r 


Exradius opposite A 

rA 


Semiperimeter 

s 



s- a 

-i^K + i)K + i)(i-i) 

(Signed) area 

[ABC] 

{--JR 2 {- --){---)) 

v 4 ' U V' ' W W ' ' V W ' ' 

Side length of BC 

a 

iRj-J 

\ V W ' 

Angle exponential 

e iA 

w 

V 

Sine 

sin(A) 

(-;*(—)) 

Cosine 

cos(A) 

^ (----) 

l \ W V ' 


Many triangle centres have simple quadratic expressions in u, v and w. Others, such as the Feuerbach points, are 
more complicated: 
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Point 

Symbol 

Expression 

Vertex 

A 

u 2 

Vertex 

B 

V 2 

Vertex 

C 

w 2 

Circumcentre 

O 

0 

Centroid 

G 

(I ( m 2 + v 2 +w 2)) 

Nine-point centre 

T 

(i ( M 2 + V 2 + W 2 )) 

Orthocentre 

H 

U 2 + V 2 + w 2 

Altitude foot on BC 

D 


Incentre 

I 

-uv - uw - vw 

Intouch point on BC 

Pa 

(u+v) (m+w) (v+w) 

-—--- - uv - uw - vw 

2 u 

Excentre opposite A 

Ia 

uv + uw — vw 

Extouch point on BC 

Qa 

(u—v) ( w—u ) (v+w) , 

---- + UV + uw - VXV 

2 u 

Nagel point 

Na 

(u + v + w) 2 

Spieker centre 

Sp 

^ (u 2 + 11 V + U W + V 2 + V w + w 2 ) 

Feuerbach point 

F 

1 ( /?(m+V+w) ,2.2. 2\ 

- -7“T-T + U L + V z + W A ) 

2 ^ u+v +w ' 

Feuerbach point 
on excircle Ia 

F a 

1 ( R{u-v-w) ,2.2. 2\ 

- 1- 7 —;—r + M + v z + w z ) 

2 v u—v -w ' 


With these results, one can express any rational function of the side lengths and basic trigonometric functions as a 
rational function in u, v and w (and their conjugates). 

Firstly, however, it is a fulfilling exercise to derive the expressions in the table above. 

33. Show that u v is the midpoint of the arc A B containing C, and thus that -u v is the midpoint of the arc A B 
not containing C. 

34. Prove that the circumcircle of A B C is the nine-point circle of the excentric triangle I A hi h:- 

35. Hence show that -u v is the midpoint of I Iq, and thus u v is the midpoint of 1 A I B . 

36. Hence verify the expressions for I, I A , I B and I c . 

With the expressions for the circumcentre and centroid, one can derive expressions for the orthocentre and nine- 
point centre by using the Euler line. Similarly, the Nagel line enables one to extrapolate expressions for the 
Spieker centre and Nagel point based on those of the incentre and centroid. 

37. If J is the reflection of I in B C, show that J has representation v 2 + w 2 + v w + (v + w) —, and that 

J j _ (u+v) (v+w) ( w+u ) 
u 

38. Hence show that 2 r — -R(- + l) (- + 1) j - + 1), and thus derive the expression for r in the table. 
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39. Prove that OI 2 = R 2 -2Rr. [Euler’s identity] 

40. Prove that IT = \ R - r, and thus that the nine-point circle and incircle are tangent. [Feuerbach’s theorem] 

The combination of the two above formulae results in the inequality OI > 2 1 T, which means that / lies in the 
Apollonius disc of diameter G H, known as the Euler-Apollonius lollipop. Geoff Smith and Christopher Bradley 
discovered that the symmedian point and Gergonne point also reside in this disc. As T lies on the line segment 
GH, it must also inhabit the Euler-Apollonius lollipop. 



As a consequence of this, together with the Euler line and Nagel line properties, the Spieker centre lies in the disc 
of diameter O G and therefore outside the Euler-Apollonius lollipop. The lines IK and H Sp intersect at the 
symmedian point of the excentral triangle, known as the mittenpunkt M. M, G, Ge are collinear with ratio 
M G : G Ge = 1 : 2. The mittenpunkt must therefore reside in the disc of diameter O G. 

Indeed, the points shown inside the circles on the diagram above always remain in those circles. The red disc 
containing the Nagel point has diameter G L, where L is the de Longchamps point (reflection of H in O). 


Cyclic quadrilaterals 


Consider an arbitrary cyclic quadrilateral T U V W (labelled anticlockwise). We denote the lengths of edges T U , 
U V, V W and W U with a, h. c and d, respectively. The lengths of diagonals U W and T V are denoted with k 
and l, respectively. 



If we reflect T in the perpendicular bisector of U W to form 7", we obtain a new cyclic quadrilateral T ' U V W 
with the same area and side lengths as T U V W, but in a different order. The diagonal U W is unaffected, and 
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remains k. The diagonal T' V, however, now has length h, in general distinct from its original length /. 

Hence, we can consider T U V W to have three diagonal lengths: k and 1 as well as the invisible diagonal of 
length h obtainable by interchanging any two adjacent side lengths. 

41. Show that [T U V W] = Ulb+i ‘ iu = llM. [Parameshvara’s formula] 

4 R 4 R 


Parameshvara’s formula is very similar to the formula [AB C] = . The latter can be regarded as a special case 

of the former, where two of the vertices of the cyclic quadrilateral are coincident. Similarly, there is a generalisa¬ 
tion of Heron’s formula applicable to cyclic quadrilaterals: 

■ If a cyclic quadrilateral T U V W has edge lengths a, b, c, d and semiperimeter s = \ (a + b + c + d), then 
[T U V W\ = V (s - a)(s - b) (s - c)(s - d) . [Brahmagupta’s formula] 

Heron’s formula is a special case of Brahmagupta’s formula, which is in turn a special case of Bretschneider’s 
formula for convex quadrilaterals. 

■ [T U V W] = yj ((s - a) (s - b)(s - c) (s - d) - a b c d cos 2 ^-^p)). [Bretschneider’s formula] 

The term abed cos 2 (-^-) = - (a c + b d + k I) (a c + b d - k /), where k and I are the diagonal lengths. It is easy to 
see, by Ptolemy’s theorem, that this vanishes when the quadrilateral is cyclic. 

By definition, a cyclic quadrilateral is inscribed in a circle, so we can use a related parametrisation to that used for 
triangles. A cyclic quadrilateral T U V W is represented by complex numbers f 2 , u 2 , v 2 and w 2 , where 
11* = u u* = v v* = w w* = R. Unfortunately, the parametrisation is slightly less elegant for cyclic polygons with an 
even number of sides, as we cannot treat all vertices and edges equivalently. 



Notice that the midpoints of the outer arcs of T U, U V and V W are indeed represented by —t u, -u r and -v w, 
respectively, as one would imagine. However, due to annoying parity constraints, this forces the midpoint of the 
outer arc of W T to be positive tw. Hence, only the triangles T U V and U V W are correctly parametrised; the 
others have asymmetric (but equally simple) formulae associated with them. Nevertheless, we can now derive the 
aforementioned seven lengths and the area via Parameshvara’s formula. It is thus straightforward to verify 
Brahmagupta’s formula. 
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Quantity 

Symbol 

Expression 

Side length of TU 

a 

iR(- --) 

Side length of UV 

b 

iR (- --) 

Side length of VW 

c 

iR{- 

Side length of WT 

d 

V vp t / 

Diagonal UW 

k 

iR[± 

Diagonal TV 

l 

iR(±- v -) 

V v t > 

Invisible diagonal 

h 

iRia +±)(i 


s - a 



s-b 



s - c 

i^(l-I)G + l)(7 + i) 


s - d 


(Signed) area 

[TUVW] 

-\iR 2 _L) 

4 Vv t ' \\v u ' V t vp/\m v / 


42. Let the multitude M a be the line passing through the midpoint ofTU and perpendicular to V W. Define M/,, 
M c and Mj similarly. Prove that the four multitudes are concurrent at a point. [Anticentre property] 

This concurrency point Q has representation ^ (t 2 + u 2 + v 2 + w 2 ), and is known as the anticentre. It is obvious 
from this that the centroid of the four vertices is the midpoint of O Q. 

43. Let the diagonals T V and U W intersect at P. M and N are the midpoints of T V and U W, respectively. 
Prove that the anticentre Q is the orthocentre of triangle MNP. 

44. Let T' be the orthocentre of U V W, and define U',V' and W' similarly. Prove that T' U' V' W' is congruent 
to TUVW. 

45. Let I T be the incentre of U V W, and define Ip, I v and I w similarly. Prove that 1 T Ip I v I w is a rectangle. 

[Japanese theorem for cyclic quadrilaterals] 

46. Let r T be the inradius of U V W, and define r v , r v and r w similarly. Prove that r T + r v = r v + r w . 

47. Suppose we have a cyclic polygon A\ A 2 ... A n . We draw n — 3 non-intersecting lines between vertices to 
dissect the polygon into n - 2 triangles. Let the sum of the inradii of the triangles be <x. Prove that the value 
of cr is independent of the choice of lines drawn. [Japanese theorem for cyclic polygons] 
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Solutions 


1. We have sin 0 cos 0 + cos 0 sin 0 = 


(P e -e~ ie ){ t y*+ e - > *)^ !l +e- ie )(e>*-e-^) _ 


4 i 


4 i 


= sin(0 + 0). 


2. Similarly, we have cos 0 cos <p - sin 0 sin 0 = 


(, e‘ e +e ~‘ 8 ) (A *+e~ i 8 -«r‘ 8 ) (A *-e~‘*) _ 2( e i ‘ fl+ «+ < r i < 8+ <») 


= cos(0 + 0). 


3. Based on the previous two theorems, tariff! + </>) = 


sin(0+0) sin 0 cos 0+cos 0 sin 0 tan 0+tan 0 


cos (6+(f>) cos0cos0-sin#sin0 1-tan# tan 0 

where the numerator and denominator are both divided by cos 0 cos <f>.) 


. (The last step is 


4. This is a consequence of the double-angle formula for cosine and the identity sin 2 0 + cos 2 0=1. 

5. cos(0 + 0) cos (8 - 0) = (cos 8 cos 0 - sin 8 sin 0) (cos 8 cos 0 + sin 8 sin 0) = cos 2 8 cos 2 0 - sin 2 6 sin 2 0. 
Using the Pythagorean identity, we obtain 

(l - sin 2 0) cos 2 0 - (l - cos 2 0) sin 2 0 = cos 2 0 - sin 2 0 = (cos 2 0 - l) + (l - sin 2 0) = cos 2 0 + cos 2 0. 

6. sin 3 0 = sin 2 0 cos 0 + cos 2 0 sin 0 = 2 sin 0 cos 2 0 + cos 2 0 sin 0 - sin 3 0 = 3 (1 - sin 2 0) sin 0 - sin 3 0. This 
clearly expands to 3 sin 0 - 4 sin 3 0. 

7. If we let 0 = - A, 0 = ^ - B and 0 = f - C, then the expression becomes equal to sin(0 + 0 + 0) by the 

compound angle formula. As A + B + C = n, it must be the case that 0 + 0 + 0 = ^, the sine of which is 1. 
Dividing by sin A sin B sin C results in the desired equation. 

8. tan A tan B tan C = tan A tan B tan(A + B) = lanA+lallB tan t an II - tan A + ran 1:1 + '* 1MI = tan A + tan B + ran C. We Can divide by 

1-tan A tanfl 1 -tan A tan B 

tan A tan B tan C to obtain cot A cot B + cot B cot C + cot C cot A = 1. 


9. The area of triangle IB C is given by ^ r a, as r is the height of the triangle when orientated such that a is 
the base. By symmetry, [ ABC] = \ ra + ^ rb + \ rc = rs. The equivalence of rs to the other terms is 
evident from considering the similar triangles C1 Pa and C Iq Yi , which provides the identity — = —. 

10. The angle L1A B = —, as / is the intersection of the three angle bisectors. We have tan ( -) = — = ——. The 
second part of the formula comes from the identity r s = rA(s - a). 

11. Applying the sine rule to triangle A l B gives us = — 3 H r = - RsmC = 47 ? sin -, with the last step 

sin- sin(- +— ] cos- 2 

2 \2 2 ) 2 

utilising the double-angle formula. Rearranging results in A l = 4 R sin sin ~. The expression for r 
originates from considering the right-angled triangle AP C I and applying basic trigonometry. 

12. As the internal and external angle bisectors of A are perpendicular, we can quickly deduce that 

lQc AI C = ~ - at and thus L A I c Q c = ^. We then have tan ^ = — from applying basic trigonometry to 

Z L Z Z Vq 

the right-angled triangle. 

13. It is straightforward, from the expressions in the table together with the Sine Rule and double-angle 
formula, to verify that each term is equal to * sin A sin B sin C. 
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14. 


15. 


16. 

17. 


18. 


19. 


20 . 

21 . 

22 . 


23. 


24. 


25. 


26. 


A combination of the sine rule and cosine rule provides cot A =-= 

sin A 

r r» s-n a bc . . b 2 +C 2 -Cl 2 

[ABC] = —, so cot A = -. 

L 1 4 R ’ 4[A B C] 


cos 4 _ 2 R cos A _ «(A 2 +c 2 -a 2 ) 


abc 


. We also have 


[ABC] = \ a b sin C = ^ b c sin A = ^ c a sin B. The first expression is equal to the final, penultimate 

and antepenultimate expressions, so is trivially equal to their geometric mean. 


sin A + sin B + sin C = — + — + — = - = 4 cos — cos - cos -, using the final expression in the table. 

2R 2R 2R R 2 2 2 & 1 


Applying the sine rule to triangle A B Cl, we obtain 

Aft _ sin (B-(b) _ sin B cos oj- cos B sin oj 
B£l sin cj sin oj 

(cot to - cot B ) (cot oj - cot C) (cot to - cot A) sin A sin B sin C = 1, from which we obtain the desired identity 
by dividing throughout by sin A sin B sin C. 


= (cot to - cot B) sin B. The cyclic product tells us that 


Expanding the previous identity results in 

cot 3 ta - (cot A + cot i? + cot C) cot 2 aj + (cot A cot B + cot B cot C + cot C cot A) cot oj - cot A cot II cot C = cosec A cosec B cosec C. Using the 

identities proved in previous questions, this simplifies to 

cot 3 to - (cot A + cot B + cot C) cot 2 to + cot to - (cot A + cot B + cot C) = 0. 


The cubic factorises to (x - cot A - cot B - cot C ) (x - i) (x + i) = 0. As it is impossible for cot to to be 
imaginary, it must instead be cot A + cot B + cot C. 


tan to = 


l 

cot A+cot 5+cot C 


sin A sin B sin C 

sin A sin B cos C+sin A cos B sin C +cos A sin B sin C 


sin A sin B sin C 
1 +cos A cos B cos C 


cot to = cot A + cot B + cot C = 


a 2 +b 2 —c 2 
4[A B C] 


b 2 +c 2 —a 2 

+ 4 [ABC] 


c 2 +a 2 -b 2 

+ 4 [ABC] 


a 2 +b 2 +c 2 
4[A B C] ' 


Proving this is equivalent to showing that cot to > xftT, or a 2 + b 1 + c 2 > 4 V3~ [ABC]. Square both sides 
and apply Heron’s formula, giving the equivalent inequality 

a 4 + b 4 + c 4 + 2 a 1 b 2 + 2 b 2 c 2 + 2 c 2 a 1 > 3 (2 a 1 b 1 + 2 b 1 c 2 + 2 c 2 a 2 - a 4 - b 4 - c 4 ). Rearranging and 
dividing by 4 gives a 4 + b 4 + c 4 > a 2 b 2 + b 2 c 1 + c 2 a 2 , which (by Muirhead’s inequality) is true, with 
equality if and only if a = b = c. 

P must lie in either triangle ACIB, B CIC or C Cl A. Without loss of generality, assume the former. Then, we 
have LPAB<LClAB<~ b from the previous question. 

We clearly have a + b + c = 2s. As [A B C] = rs = , we obtain a b c = 4 R r s. By squaring Heron’s 

formula and dividing throughout by s, we get (.s - a) (s - b) (s - c) = r 2 s. Polynomial expansion results in 
s 2 - (a + b + c) s 2 + (a b + b c + c a) s - a b c = r 2 s. The other symmetric polynomials can be replaced by the 
expressions in R, r and s, yielding s 2 - 2 s 3 + (a b + bc + c a) s-4Rrs = r 2 s. Final manipulation and 
division by s culminates in the expression for the third symmetric polynomial, 
ab + b c + ca= s 2 + r 2 + 4 Rr. 


Schur’s inequality provides 1(1 - m) (1 - n) + m(m - n) (m - 1) + n(n - 1) (« - m) > 0. We can then expand to 
obtain / 3 + m 2 + n 3 + 3 l m n > l 2 m + m 2 l + m 2 n + n 1 m + n 2 l + l 2 n. Adding / 3 + m 2 + n 2 to each side of the 
equation yields 2 ( l 3 + m 3 + n 3 ) + 3 / m n > (l + m + n) (l 2 + m 2 + n 2 ). Replacing each of these symmetric 
polynomials with their R, r and s counterparts gives 2 (s 3 - 12 R r s) + 3 r 2 s > s(-2 r-8/ir + s 2 J. 
Rearranging, we obtain s 2 - 16 R r ,v + 5 r 2 .v > 0. Further rearrangement and division by 5 gives the required 
inequality, s 2 + 5 r 2 > 16 Rr. Equality occurs if and only if the triangle is equilateral. 


We can convert the expression into an inequality in /, m, n. Firstly, we derive the expressions r 2 = , 
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4 R 2 = 1/+ ”'j j”| ,> and 4Rr = lm + mn + nl-r 2 .By multiplying the inequality by 4 1 m n(l + m + n), 

we obtain the equivalent inequality 

4 / m nil + m + n) 2 < (1 + m) 2 (m + «) 2 (n + l) 2 + 41 m n((l m + m n + n /) (/ + m + n) + 21 m n). It is now 
helpful to apply the u v w method to express it as w 6 + (2 u v 2 - 12 « 3 ) w 3 + 9 n 2 v 4 > 0. This is a quadratic in 
w 3 , so we only need to check three cases by Tejs’ corollary. The third case only occurs when F w 2 + G- 0, 
or w 2 + u v 2 — 6 u 2 = 0. The expression is negative since u > v > w, so this cannot occur. Hence, we only 
need to consider when / = m or n = 0. In the latter, we have a degenerate triangle comprising three collinear 
points, and thus r = 0, 2 R = s; this satisfies the inequality. In the former case, the triangle is isosceles and 
the inequality reduces to 4 / 2 n(2 1 + n ) 3 < 4 l 2 (l + n) 4 + 41 2 «((/ 2 + 2 1 it) (21 + n) + 2 / 2 «). Expanding out 
gives the equivalent inequality 4 / 6 - 8 I s n + 41 4 n 2 > 0, or 4 / 4 (/ - n) 2 > 0. This is trivially true. 


27. With Muirhead’s inequality, it is trivial to verify that (/ m + m n + n /) (l + m + n) > 9 l m n. Expanding each 
term gives (r 2 + 4 R rj s >9 r 2 s, which simplifies to 4 R r s > 8 r 2 s. Dividing throughout by 4 rs yields the 
desired inequality. 

28. [O A B] = \ R 2 sin 2 C, so we have [AB C] = \ f? 2 (sin 2 A + sin 2 B + sin 2 Cj. Hence, the left-hand side of 

s ^ ^i 2 k s s 2 v s 

the inequality is equal to -, and the right-hand side is equal to ——-— 1 = —. So, we need to prove - > —-, 
which simplifies to Euler’s inequality, R > 2 r. 


29. 


cot to = 


g 2 +b 2 +c 2 

4[A B C] 


s 2 ~r 2 -4Rr 
2 rs 


30. cos A + cos B + cos C = 2 


b 2 +c 2 —a 2 _ „ ab 2 +ac 2 -a 3 _ (a 2 +b 2 +c 2 )(a+b+c)-2(a 3 +b 3 +c 3 ) 


eye 


2 be 


= Z 


eye 


2 abc 


2 abc 


. We can now apply the 


4s(s 2 -r 2 -4Rr)-4s(s 2 -3r 2 -6Rr) 2r 1 +2Rr 

formulae to convert this expression to--. Some cancellation results in ■ 


8 Rrs 

r ABC 

which further simplifies to 1 + -. We already have r = 4 R sin — sin — sin —, so we are done. 

R 2 2 2 


2 Rr 


31. The previous equation works for all triangles, and thus, due to the analytic continuity of sine and cosine, 
works for all angles such that A + B + C = 7t. If we use the angles A + 2 n, B - n and C -n, we obtain the 
equation cos A - cos B - cos C = 1 + sin + n) sin(-^ - sin(-^ - ~), the right-hand side of which 

simplifies to 1 - sin 4 cos ^ cos ^. 


32. Note that cos A + cos B + cos C = 1 + - and cos A - cos B - cos C = 1 - — by the previous two results. We 

R R 

add the first equation to the cyclic sum of the second equation, yielding 0 = 4 + -^ - 
Multiplying throughout by R and rearranging gives 4 R = r A + r H + r c - r. 

33. The magnitude of u v is the same as u 2 and v 2 , namely R. We also have 

2 arg(« v) = arg(n 2 v 2 ) = arg(z< 2 J + arg(v 2 ). Hence, it must be the bisector of one of the arcs, namely the arc 
containing C (as we have defined it as such). 

34. The angle L1 A I c is a right-angle. By symmetry, I must be the orthocentre of I A I B I c , and thus ABC is its 
orthic triangle. The circumcircle of the orthic triangle is the nine-point circle. 

35. —u v lies on the angle bisector I I c , due to the ‘equal arcs subtend equal angles’ property. As it lies on the 
nine-point circle of II A I B l c , it must be the Euler point (midpoint of / l (: ). The nine-point centre of the 
excentric triangle is O, the circumcentre of the reference triangle. It is also the barycentre of /, I A , I B and I c , 
so must be the midpoint of the line joining the midpoints of I I c with I A 1 B . This enables us to deduce that 

u v is indeed the midpoint of I A 1 B . 
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36. 


37. 


38. 


39. 


40. 


41. 


42. 


43. 


44. 

45. 


46 . 


In complex coordinates, we have 

I A = (I A +I B + Ic) ~ Ub + Ic)= j (44 + Ib) ~ \ dfi + h:) + “ (7c + 40 = m v - v w + w u, thence it is trivial to 
find the representations of the other excentres and incentre. 


Using the formula for the reflection of a point in a chord, we have that J has representation 
r 2 

V 2 W VV 2 V 


9 9 v 2 vi ^(v* w*+w* u*+u* v*) 9 9 v 2 w w 2 v TT ~T~t i „ . • 

v + w +---- = v + w + v w + — + —. Hence, I J has representation 


V 2 + vv 2 - V w 


+ MV + VW + W «. Multiplying throughout by m yields the symmetric expression 


id v + v" u + v 2 vv + vv 2 v + u- u + u 2 w + 2 u v vv. This factorises to ( u + v) (v + w) (vv + u), so the original 

• . (u+v) (v+w) (w+u) 

expression is -—————-. 


Multiplying the expression for IJ by its complex conjugate gives 

4 r 2 = = r2 ^ + i) (i + + 1 ) (1 + T) (L + 1 )( 1 + L). T he square root of 

this is thus the distance 2 r. 

We have O I 2 = (uv + vw + w u) (u* v* + v* vv* + vv* u*) = R( 3 R + uw* + w u* + v u* + uv* + w v* + v vv*). 
Combined with the expression for r given in the table, this equals R(R - 2 / j, as required. 

1 T = ^ (« 2 + v 2 + w 2 ) + uv + vw + wu = - (u + v + w) 2 . It is obvious that the modulus is 

- (u + v + w) (u* + v* + vv*) = - (3 R + u vv* + w u* + v u* + u v* + w v* + v vv*) = 4 R - r. 

[T U V W] = [T U V] + [V W T] = ~ ^ = U ' h + L R hl ■ By Ptolemy’s theorem on the quadrilateral 

T' U V W (where T' is the reflection of T in the perpendicular bisector of U W), we have ab + cd = l, 
giving us Parameshvara’s formula. 

The maltitude passes through 4 (u 2 + r 2 ), and travels parallel to the vector 4 (v 2 + vv 2 ). It is clear that 

- (t 2 + u 2 + v 2 + w 2 ) lies on this maltitude, and thus all four maltitudes by symmetry. 

If we consider the other two maltitudes (from the midpoint of each diagonal perpendicular to the other 
diagonal), they must also pass through Q. By definition, they also pass through the orthocentre of M N P, so 
Q must be this orthocentre. 

7” has representation u 2 + v 2 + w 2 . This is the reflection of T in the anticentre Q. Hence, T' U' V' W' is 
congruent and homothetic to the original with Q as the centre of similitude. 


Let l v be the incentre of V W T, et cetera. We have I u = -vw + tw + v t, Ij = -u v - v vv - w u, and 
Iw = ~t u - u v - v t. We wish to prove that LI T I n I v = -. which is equivalent to Iv Iv = m " +l> being 

2 If—Iu ( t+u ) (w+v) 

purely imaginary. We know that t, u, v and w have equal modulus, so must be concyclic. Hence, is 

real, and we only need to prove that — = 111 ,>( " + ' ] is imaginary. The numerator is t* w - w* t, which is 

J r w+t (w+r)(w+r) ° J 

equal to the negative of its conjugate and is therefore imaginary. Similarly, the denominator is equal to its 
conjugate and therefore real. Hence, we are done and LI T I v I v = ^; by symmetry, 1 T I v I v I w is a rectangle. 


For any point P in the plane of a rectangle ABC D, we have A P 1 + C P 2 = B P 2 + D P 2 . This can be derived 
from assuming P is the origin and orienting the rectangle parallel to the coordinate axes and using Cartesian 
coordinates. Applying this to the point O (circumcentre of T U V W) and the rectangle of incentres, we 
obtain O Ij 2 + O ly 2 = O Ip 2 + O l\y 2 . We can determine each of these squared distances from Euler’s 
formula, obtaining the equation ( R 2 -2 R r T ) + ( R 2 -2Rr v ) = ( R 2 -2R rp'j + ( R 2 -2 R r w ). Cancelling 
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terms and dividing throughout by R yields the desired equation. 


47. This is true for n = 4, by the previous question. We prove the general case by induction on n, assuming it is 
true for all n < k, and proving it for n = k + I. Note that for every triangulation, there must be at least one 
triangle with three adjacent vertices. Suppose we have a triangulation where A\ A 2 A 3 is one of the triangles. 
Then we can ‘re-triangulate’ the k-gon A 3 A 4 ... Ai ;+i Ai such that Ai A 3 A 4 is also a triangle. Now, re¬ 
triangulate the cyclic quadrilateral Ai A 2 A 3 A 4 , so that A 2 A 3 A 4 is a triangle. Repeating this process, we can 
ensure that A,-_i A,- A,-+i is a triangle for any i (with subscripts considered modulo k + 1). Arbitrarily re¬ 
triangulating the k-gon Aj+i A ,+2 • ■ • A ,-_2 A,_i gives any possible triangulation of the (k + l)-gon. As we did 
not affect <x during any of the re-triangulations of the k- gons, <x has remained constant throughout the whole 
process. 
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Areal coordinates 

A large quantity of problems are concerned with a triangle ABC , known as the reference triangle. It is particu¬ 
larly useful, in these instances, to apply a special type of projective homogeneous coordinates, namely areal 
coordinates. The vertices A, B and C are given by the coordinates (1,0, 0), (0, 1, 0) and (0, 0, 1), respectively. 
The line at infinity is given by x + y + z = 0. This exploits the symmetry of the triangle in a way that Cartesian 
coordinates do not. 


Areas and lines 


We have already defined areal coordinates as a special case of projective homogeneous coordinates. However, 
there are several other equivalent definitions explored later, explaining the synonyms ‘areal’ and ‘barycentric’. 



We can normalise the areal coordinates (x, y, z) in the plane (not on the line at infinity) by assuming that 
x + y + z= 1. To convert unnormalised areals into their normalised counterparts, simply apply the map 

( x , y, z) ->-. 

x+y+z 


1. If the point P is represented by normalised areal coordinates (x, y, z), prove that x = • (Hint: consider 

the volumes of tetrahedra RABC and R P B C.) 

This gives us one definition of areal coordinates, namely the ratio between the areas [P B C], [PC A] and [P A B] 
If the triangle [A B C] has unit area, then the areas of these triangles are equal to the normalised areal coordinates 
This is encapsulated by the following diagram from Tom Lovering’s excellent introduction to areal coordinates 
(available at http://www.bmoc.maths.org/home/areals.pdf). 
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As areal coordinates can be defined in terms of ratios of areas of triangles (which are unchanged by affine transfor¬ 
mations), the areal coordinates of a point remain invariant when an affine transformation is applied. 

2. Deduce that the lines B C, C A and A B correspond to the equations x = 0, y = 0 and z = 0, respectively. 

3. Show that the centroid, G, has normalised areal coordinates , -. -). 

' 3 ’ 3’ 3/ 


4. 


Let the points P , Q and S be represented by normalised areal coordinates (xi, y 1 , zi), (X 2 , yi, Z 2 ) and 



' Xl 

*2 

*3 ’ 

(* 3 , y 3 , 23 ), respectively. Show that det 

y\ 

yi 

y3 


, Zl 

Zl 

Z3 > 


[PgS] 
[ABC] ' 


Ceva’s theorem and cevians 

5. Show that the equation of the line A P, where P = (x 1 , yu z 1 ), is given by yi z= ,"1 y. Hence find the 
coordinates of the intersection point, L = A P f] B C. 

The line A P is known as a cevian through A, named after Ceva’s theorem. This can easily be proved using areal 
coordinates. 



6 . Let L, M and N lie on sides BC, C A and A B, respectively, of a triangle ABC. Show that AL, B M and 
C N are concurrent if and only if ■ — = 1. [Ceva’s theorem] 

LC MA NB 








122 | Mathematical Olympiad Dark Arts 


7. Suppose we have a point P, and draw the three cevians through it to meet the sides B C, C A and A B at L, 

M and N, respectively. Reflect L in the perpendicular bisector of B C to obtain L\ and define M' and N' 
similarly. Prove that A L \ B M' and C N ' are concurrent. [Existence of isotomic conjugates] 

8 . Let L, M and N lie on sides BC, CA and A B, respectively, of a triangle ABC. Show that AL, B M and 

C N are concurrent if and only if--= 1. [Trigonometric Leva s theorem] 

J sin LCAL sin LABM sinLBC N 

9. Suppose we have a point P, and draw the three cevians through it. Reflect the cevian through A in the line 
A I, and repeat for the other two cevians. Prove that these three new lines are concurrent. [Existence of 
isogonal conjugates] 

In unnormalised areal coordinates, the isotomic conjugate of (x, y, z) is given by (1/x, 1 /y, 1 /z) and the isogonal 
conjugate of (x, y, z) is (a 2 /x, b 2 j y, c 2 j z). The symmedian point (intersection of the reflections of the medians 
in the corresponding angle bisectors) is defined as the isogonal conjugate of the centroid, giving it unnormalised 
areal coordinates (a 2 , b 2 , c 2 ). Here are the unnormalised coordinates of common triangle centres: 


Point 

Unnormalised areal coordinates of point 


X 

y 

z 

Vertex A 

1 

0 

0 

Centroid 

1 

l 

1 

Incentre 

a 

b 

C 

Excentre opposite A 

-a 

b 

c 

Nagel point 

s - a 

s- b 

s - c 

Gergonne point 

fA 

r B 

rc 

Symmedian point 

a 2 

b 2 

c 2 

Circumcentre 

sin(2 A) 

sin (2 B ) 

sin(2 C) 

Orthocentre 

tan(A) 

tan (B) 

tan(C) 

Nine-point centre 

sin(2 B ) + sin(2 C) 

sin(2 A) + sin(2 C) 

sin(2 A) + sin(2fi) 

First Brocard point 

1 

b 2 

1 

c 2 

1 

a 2 

Second Brocard point 

1 

c 2 

1 

a 2 

1 

b 2 


The circumcentre, orthocentre and nine-point centre also have non-trigonometric forms (expressed in terms of a 2 , 
b 2 and c 2 alone). However, they are even more complicated than the trigonometrical expressions here, so their 
practical use is unrecommended. If these points are involved, it is better to use the parametrisation involving 
complex numbers. 

By analogy with the first and second Brocard points, the triangle centre with areal coordinates j is 

known as the third Brocard point. Apart from being the isotomic conjugate of the symmedian point, it is com¬ 
pletely boring. 

10. Using the formulae for isogonal conjugates, prove that the incentre and excentres indeed have the 
coordinates shown in the table. 

11. Prove that the orthocentre has normalised coordinates (cot B cot C, cot C cot A, cot A cot B). 

12. Show that the circumcentre and orthocentre are isogonal conjugates. 
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Barycentres and Huygens-Steiner 


Another interpretation of the point with coordinates ( x , y, z) is the barycentre (centre of mass) of the system 
where masses of x, y and z are placed at the vertices A, B and C, respectively. Hence, areal coordinates are 
occasionally known as barycentric coordinates. 

13. Suppose we have a set S of masses in the plane, with total mass m t + m 2 + • • • + m„ = 1. The mass m l is 
located at the point A,, and the barycentre is denoted P. For any point Q in the plane, define the weighted 
mean square distance [^(S 1 , Q) = ’£ I m i(Aj Q) 2 . Prove that P Q 2 = i'vK.S', Q) - [VI(.S', P ). [Huygens-Steiner 
theorem] 

This theorem is named after Jakob Steiner and Christiaan Huygens. The latter is famous for inventing the pendu¬ 
lum clock, proposing a wave theory of light, and discovering Titan (the largest of Saturn’s many moons) with a 
telescope he built. 

There are equivalent formulations of the Huygens-Steiner theorem in mechanics (the parallel-axis theorem) and 
statistics (o- 2 = E(X 2 ) - E(X) 2 ). 



As a corollary of this theorem, the centroid of a set of points minimises the sum of squared distances to each of 
those points. This is demonstrated by the density plot of the function A P 2 + B P 2 + C P 2 , which has a global 
minimum at P = G. 

By repeated application of Huygens-Steiner, we can determine the weighted mean square distance between two 
sets. The ordinary version can be regarded as the case where one of the sets has a single element. 

■ Suppose we have two sets, Si and S2, each with unit total mass. Every mass m,- e Si is located at the point A,; every 
mass tij e S2 is located at the point Bj. The barycentres of Si and S2 are denoted P 1 and P 2 , respectively. We define 

the weighted mean square distance M(Si, S 2 ) = ^V; n j(A{ Bj ) 2 . Then we have 

Pi P 2 + 1 V 1 (S1. Pi) + WS 2 , P 2 ) = M(Si, S2). [Generalised Huygens-Steiner theorem] 

It is particularly relevant to our exploration of barycentric coordinates to consider Huygens-Steiner where n = 3 
and the masses are positioned at the vertices of the reference triangle ABC. 
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14. If O is the circumcentre of the reference triangle ABC and P has normalised areal coordinates (x, y, z), 
show that O P 2 = R 2 -[xAP 2 + y B P 2 + zC P 2 ). 

15. Hence show that O G 2 = R 2 - ^ (cr + h 1 + c 2 ). [Circumcentre-centroid distance] 

16. Prove that O I 2 = R 2 - 2 R r. [Euler’s formula] 

17. For every n > 3, determine all the configurations of n distinct points X\, Xo, ..., X„ in the plane, with the 

property that for any pair of distinct points X h Xj there exists a permutation cr of the integers (1,2, ...,«] 
such that d(Xj, X *.) = d(Xj, X^J for all k e (1, 2, where d(A, B) denotes the distance between A and 

B. [RMM 2011, Question 5, Alexander (formerly, at the time he composed the problem, known as 
Luke) Betts] 



18. A quadrilateral ABC D is drawn in the plane. Show that the midpoints of the four sides, midpoints of the 
two diagonals, intersections of opposite sides, and intersection of the diagonals all lie on a single conic. 
Show further that this conic cannot be a parabola. [Nine-point conic] 


Distance geometry 


19. In a triangle ABC (with the side lengths labelled in the usual way), we choose a point D on B C such that 
BD = m, CD = n and AD = d. Prove that man + d ad = bmb + cnc. [Stewart’s theorem] 
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A 



Stewart’s theorem is particularly attractive, as it is defined solely in terms of distances and nothing else. It can be 
derived through simple application of the cosine rule; however, the derivation using the Huygens-Steiner theorem 
remains firmly within the realms of distance geometry. The statement of Stewart’s theorem can be remembered 
with the mnemonic ‘a man and his dad put a bomb in the sink’. 

If m = n, then Stewart’s theorem reduces to a special case called Apollonius ’ theorem. 

■ Suppose we have a triangle ABC, where M is the midpoint of B C. Then A M 2 = ^b 2 + \ c 2 a 2 . [Apollonius’ 
theorem] 

A much more impressive theorem in distance geometry is that of the Cayley-Menger determinant. If 
A\ A 2 ... A n A n+ 1 is a «-simplex with volume V, then the following identity applies. 


(0 

1 

1 

1 

1 

1 

0 

AiA 2 2 

A1A3 2 

••• Ai A„ + i 2 

1 

A 2 Aj 2 

0 

a 2 a 3 2 

■■■ A 2 A n+1 2 

1 

A3 A 2 

A3 a 2 2 

0 

■■■ a 3 a „ +1 2 

, 1 

A„+ 1 Ai~ 

A„+1 A 2 ‘ 

A„+1 A3 2 

0 , 


. [Cayley-Menger determinant] 


For n = 2, this is equivalent to Heron’s formula. For n = 3, this is known as Tartaglia’s formula (remember that 
angry guy who solved the cubic equation?) for the volume of a tetrahedron. Equating this to zero gives an equa¬ 
tion relating the squared distances between four coplanar points, which can itself be considered to be a generalisa¬ 
tion of Stewart’s theorem. 


20. Prove that R 2 = 


_ a 2 b 2 c 2 _ 

( a+b+c ) ( a+b—c ) ( a—b+c) (— a+b+c) 


Circles in areal coordinates 


21. Let ABC be the reference triangle, with side lengths BC-a, C A-b and AB = c. Show that if two points 
have normalised areal coordinates P = (x\, y\, <q) and Q = (x 2 , y 2 , z 2 ), then P Q 2 = - a 2 v w - b 2 w u - c 2 uv, 
where u = x\ - x-i, v = yi - yo and w = zi - Zi■ [Areal distance formula] 

By considering a circle to be the locus of points of a particular distance from a given point, we obtain the general 
formula for a circle. 

■ A circle has the equation a 2 y z + b 2 z x + c 2 x y + (x + y + z) (A x + B y + C z) = 0 , where A, B and C are constants. 

[Equation of a circle] 

The (x + y + z) bracket is included to make the equation homogeneous, so that it is compatible with unnormalised 
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coordinates. From this, we obtain the equation of a circle through three given points. 

■ The circle through points P = (xi, yi, zi), Q = (x 2 , y 2 , z 2 ) and R = (x 3 , y 3 , z 3 ) is given by equating the determinant of 
the following matrix to zero: [Concyclicity condition] 

' x(x + y + z) xi(xi + yi + zi) x 2 (x 2 + y 2 + Z 2 ) x 3 (x 3 + y 3 + z 3 ) 

y(x + y + z) yi(x 3 + yi + zt) y 2 (x 2 + y 2 + z 2 ) y 3 (x 3 + y 3 + z 3 ) 

z(x + y + z) ziOt + yi+zt) z 2 (x 2 + y 2 + Z 2 ) z 3 (x 3 + y 3 + z 3 ) 

a 2 yz + b 2 zx + c 2 xy a 2 y x Z\ + b 2 Z\ x x + c 2 x-i yi a 2 y 2 z 2 + b 2 z 2 x 2 + c 2 x 2 y 2 a 2 y 3 z 3 + b 2 z 3 x 3 + c 2 x 3 y 3 t 

This is itself a special case of a variant of Goucher’s theorem applicable to areal coordinates. 


■ Let S = (X4, >'4, 74), P = {x\, yi, Zi), Q = ( x 2 , y 2 , Z2 ) and R = (x 3 , y 3 , 73). Then the determinant of the following 
matrix: 

44(44 + 34 + Z4) Xitn + fi+Zi) x 2 ( x 2 + y 2 + z 2 ) 4:3(4:3 + 313 + 23) 

34(4:4 + 34+24) _V| (+! + Vi + 2]) 32(4:2 + 32 + 22) 32(+3 + 3.3 + 33) 

24(44 + 34+24) Zl( 4 l+ 3 l+ 2 l) 22(42 + 32 + 22) 23(43 + 33+23) 

a 2 34 2 4 + b 2 zt 44 + c 2 44 34 a 2 31 2i + b 2 Zi x t + c 2 4, 33 a 2 y 2 z 2 + b 2 z 2 x 2 + c 2 4 2 y 2 a 2 33 23 + 6 2 z 3 43 + c 2 43 33 
, , , 9 [Pgs] 

is equal to (x\ + yi + zi ) 2 (x 2 + y 2 + z 2 ) (x 3 + y 3 + z 3 ) (x 4 + y 4 + za) Aflc Power( 5 , P Q R). [Goucher s theorem 
for areal coordinates] 

22. Show that the power of a point P = (x, y, z) (in normalised areal coordinates) with respect to the 
circumcircle of the reference triangle ABC is given by Power (P, AB C) = -a 2 y z - b 2 zx - c 2 x y. [Power 
with respect to circumcircle] 

We can combine this with Huygens-Steiner to yield the following equation: 

■ R 2 - O P 2 = a 2 y z + b 2 zx + c 2 x y = x A P 2 + y B P 2 + zC P 2 , where P = ( x, y, z) in normalised areals. 

This enables us to calculate the distances between the circumcentre and several other points. 

23. Flence show that R 2 - O I 2 = 2 Rr. [Euler’s formula] 

24. Prove similarly that R 2 - OI a 2 = -2 R r A . [Excentral analogue of Euler’s formula] 

25. Flence prove that O I 2 + OI A 2 + O Ig 2 + O Ic 2 = 12 R 2 . 

26. Demonstrate also that R 2 - OH 2 = 4[A B C] cot A cot B cot C = 8 R 2 cos A cos B cos C. [Power of the 
orthocentre] 

As areal coordinates are projective homogeneous coordinates, conics have the general form 
Ax 2 +By 2 + Cz 2 +Dyz + Ezx + F x y = 0, where A, B , C, D, E and F are constants. It is easy to see that a 
circle is thus a special case of a conic. 

27. Let P be chosen randomly in the interior of triangle ABC, such that equal areas have equal probabilities of 
containing P. Find the probability that [A B P] > y/ [B C P] + VI C A P] . [Adapted from RMM 2008] 


Barycentric combinations of circles 


The theory of barycentric combinations of circles is a relatively recent one, emerging from the following problem: 

28. Let Ti, Y 2 and r 3 each pass through fixed points A and B. Let a line / pass through B and meet the circles 
again at Pi, Pi and P 3 . Prove that the ratio Pi Pi : Pi P\ is independent of I. [Adapted from APMO 2012, 
Question 4] 

The original problem was solved in many unique ways by members of the British IMO squad, using techniques 
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such as spiral similarity, vectors, coordinates, inversion, trigonometry and similar triangles. 

Let T be a circle with centre O and radius r. We uniformly distribute a unit mass around the circumference of L 
By applying the Huygens-Steiner theorem, we can deduce that 

M(I\ P) = M(r, O) + O P 2 = r 2 + O P 2 = 2 r 2 + Power (P, F). 



Suppose we have n circles in the plane, {Ti, ..., r„), each considered to have a mass m, such that 

m\ + W 2 + ... + m n = 1. We then define an ‘average circle’ F such that Power! P, F) = £(/«,■ Power! P. F,j). This is 

possible by considering the equation for the power of a point in either Cartesian or areal coordinates. 

29. Prove that the centre of F is the weighted centroid G = £(«; Of), where O, is the centre of T,. [Barycentric 
combination of circles] 

It is now possible to determine the radius of F. 

■ R 2 = -Power! G, F) = £m ; (r ; 2 - G O 2 ), where the circle r, has centre O, and radius [Radius of barycentric 
circle] 

Certain barycentric combinations of circles are interesting. 

30. Let ABC be a triangle with circumcircle r. T,,, T /; and T c are the reflections of Y in the sides B C, CA and 
A B, respectively. Show that the average of the three circles r B and T c is the Euler-Apollonius lollipop. 



31. What is the average of the four circles T, r B , r c ? 

If we have four points A, B, C, D which do not form a cyclic quadrilateral, then every circle in the plane can be 
expressed uniquely as a barycentric combination of the four circles B C D, C D A, DAB , ABC. In other words, 
the set of circles on the plane is isomorphic to a subset of projective three-space P 3 . This idea of giving things 
other than points coordinates is not a new one; Pliicker created a geometry based on the four-dimensional space of 
lines in IR 3 . 
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Solutions 


1. By using V = - Ah, where A is the base area and h is the perpendicular height (i.e. distance from the origin 


to the reference plane), we have [P B C] = j[R P B C] = det 


(x y z\ 

0 1 0 
v 0 0 1 


= —. For the point A, x = 1 so 


this area is equal to . Hence, [P B C] = x [ABC]. 

2. If P lies on the line B C, the area x is obviously zero (as X B C is a straight line). By symmetry, we obtain 
the equations of the other two lines. 

3. For the centroid, we have [G BC\ = [GCA] = [GAB], When normalised so these areas sum to unity, they 
must each equal ^. 


4. This is the same argument as in the first question, but with the tetrahedra S P QR and AB C R. 

5. We can scale the coordinates of P such that x\ = 1. As the line passes through (1, —, —) and (1,0, 0), it 

V *1 X\ ) 

must obviously be the equation y\ z = z i }’. Hence, the intersection point has unnormalised coordinates 
(0, yu Zi). 


6 . Assume P exists and P=(x\, y \, zi). By the previous result, we have —- = —. The cyclic product is 


LC 


= 1. For the converse result, we know that there must exist precisely one point L on 


B L CM AN _ My\Zt 
LC Mt N~B Xl yi Zl 

B C such that AL,BM and C N are concurrent, and it must thus be the case where —' • = 1. 

LC MA NB 


7. This process effectively ‘flips’ each fraction in Ceva’s theorem, so the product remains equal to unity and 
thus the cevians are concurrent. 


8 . By the sine rule, we have smLLAB = — anc j smLCAL _ Dividing one by the other results in 

J sin B LA sin C LA £> J 

Sm L r'l B j = • Substituting cyclic permutations of this into the Ceva equation yields 

sin l— Cs A 

sin L LA B sin L M B C sin L N C A _ ^ 
sin L C A L sin L AB M sin L B C N 

9. This process effectively ‘flips’ each fraction in trigonometric Ceva’s theorem, so the product remains equal 
to unity and thus the Cevians are concurrent. 

10. The incentre and excentres must be their own isogonal conjugates, thus have unnormalised areal 
coordinates (±<r, ±b, ±c). The incentre is the only one with a symmetrical expression, (a, b, c). 

11. Using the identity tan A tan B tan C = tan A + tan B + tan C, we can divide each term in the expression 
(tan A, tan B, tan Cj by tan A tan B tan C to obtain (cot B cot C, cot C cot A, cot A cot B). 

12. BOA is isosceles, so LO AB = ^ - C. As AH is perpendicular to B C, we have LC AO = * - C. By 
symmetry, we are done. 

13. In Cartesian coordinates, let mass m\ be placed at Ai = (x\, y{), et cetera, and let = 1. Let P = ( u , v) be 
the barycentre, and Q = (x, y) be a variable point. By Pythagoras’ theorem, we have 
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A\ Q 2 = (x - xfj 2 + ( y - Vi) 2 = x 2 + y 2 - 2 x\x - 2 yi y + c\, where cq is a constant term that doesn’t really 
matter. Repeat for all points in this manner, and calculate the weighted sum. The weighted mean square 
distance is given by IVI(S, Q) = x 2 + y 2 - 2 u x - 2 v y + c, where c is another unimportant constant. But this 
is just (x - u ) 2 + (y - v) 2 + k, for some constant k, or P Q 2 + k. Substituting P = Q gives k = M(S, P). 
(Normally, one would not use Cartesian coordinates to solve a problem. However, in RMM 2011, l was 
under the influence of alcohol, so actually successfully performed this derivation.) 

14 . Invoking the Huygens-Steiner theorem once again, we obtain 

O P 2 = xR 2 + y R 2 + zR 2 - (xA P 2 + y B P 2 + zC P 2 ) = R 2 - ( xAP 2 + y B P 2 + zCP 2 ). 

15 . G has normalised areals ^^), giving us O G 2 = R 1 - ^ (AG 2 + BG 2 + C G 2 ). If D is the midpoint of 
B C, we obtain AD 2 = \ b 2 + \ c 2 - a 2 from Stewart’s theorem. Multiplying by ^ results in 

AG 2 = y ) b 2 + y ) c 2 - ^ a 2 , hence the cyclic sum A G 2 + B G 2 + C G 2 = ^ [a 2 + b 2 + c 2 ). Substituting this into 
the expression for O G 2 yields the desired formula. 

16 . 1 has normalised areals (——, ——, —-—), giving us O I 2 = R 2 --— (a A I 2 + b B l 2 + c C I 2 ). 

' a+b+c a+b+c a+b+c' a+b+c v > 

Applying Pythagoras’ theorem yields A l 2 = r 2 + (s - a) 2 = r 2 + s 2 - 2a s + a 2 . Hence, 
a A I 2 + b B I 2 + c C I 2 = (a + b + c) r 2 + (a + b + c) s 2 - 2 [a 2 + b 2 + c 2 ) s + (a 3 + b 2 + c 3 ). We can then 
convert this into an expression in terms of R, r and s, namely 

2ST 2 + 2s 3 +4s(r 2 +4Rr-s 2 ) + 2s(s 2 -3r 2 -6Rr) = 4Rrs. Hence, 0/ 2 =R 2 - 1 |^ =R 2 -2Rr. 


17 . The sum of squared distances from each point to the other points is constant. Hence, using the Huygens- 
Steiner theorem, all points must be concyclic. By considering the closest pairs of points, the points must be 
the vertices of a regular polygon or truncated regular polygon. 


18 . Apply an affine transformation to make D the orthocentre oiABC. Then, those nine points lie on a conic 
(the nine-point circle), and T is the barycentre of ABC D. Reversing the affine transformation results in a 
conic passing through those nine points; the centre of the conic is the barycentre of ABC D. However, a 
parabola has no centre, so the conic cannot possibly be a parabola. 


19 . D has normalised areals (0, -, —). Using the Huygens-Steiner theorem, we have 
AD 2 = -AB 2 + — AC 2 - -DB 2 - — D C 2 , 

a a a a 

us the theorem, man + dad = bmb + cnc. 


AD 2 = - AB 2 + — AC 2 - - D B 2 - — D C 2 , or d 2 = — + — _ tlUl _ AJE Multiplying through by a gives 


fO 

1 


20 . By considering the circumcentre and three vertices, det 


1 1 1 
0 R 2 R 2 


1 R 2 0 a 2 


1 R 2 


0 


1 R 2 b 2 c 2 

0 


1 

R 2 

b 2 
c 2 

0 


= 0. We now subtract the last 


row from the second, third and fourth rows to obtain det 


1 

-R 2 

0 

0 

R 2 


use the recursive formula to reduce this to the 4x4 determinant det 


Subtract the fourth column from the other three columns, giving the equivalent equation 


1 

1 

1 




,2 -b 2 

R 2 -c 2 

R 2 




-b 2 

a 2 -c 2 

b 2 

= 

0. We can now 

2 — b 2 

-c 2 

c 2 




b 2 

c 2 

0 . 




1 

1 

1 


1 j 


-R 1 

R 2 - b 2 

R 1 - 

c 2 

R 2 


0 

-b 2 

a 2 - 

c 2 

b 2 

= 0. 

0 

a 2 -b 2 

-c : 


c 2 , 
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det 


a 


.2 b 2 


= 0. We can then apply the recursive formula to give the 3x3 


= 0. It is now convenient to use the Rule of Sarrus to 


0 0 0 I 

-2 R 2 - b 2 

-b 2 -2 b 2 

-c 2 a 2 -b 2 - c 2 
' 2 R 2 

determinant det b 2 
.2 

evaluate this directly, resulting in the equation 

8 R 2 b 2 c 2 + 2 b 2 c 2 (b 2 + c 2 - a 2 ) = 2 R 2 (b 2 + c 2 - a 2 )’ + 2 b 2 c 4 + 2 c 2 b A . Dividing throughout by two and 
rearranging gives R 2 ((2 b c) 2 - (b 2 + c 2 - a 2 )“) = a 2 b 2 c 2 . Applying the difference of two squares to the left 
hand side yields R 2 [a 2 - (b - c) 2 ) ((b + c ) 2 - a 2 ) = a 2 b 2 c 2 . Another couple of applications enables further 
factorisation to R 2 (a + b + c)(a + b - c)(a - b + c) (-a + b + c) = a 2 b 2 c 2 . 


—c 

2 -b 2 -c 
■ 2 c 2 

b 2 c 2 

2 b 2 b 2 + c 2 - a 
<r b 2 + c 2 - a 2 2 c 2 


21. Represent A, B and C with complex numbers Z, m and n, respectively, where 11* + m m* + n n* = R 2 . Then 
we have p = xi l + y\ m + zi n and q = X 2 1 + yi m + Z 2 n. Subtracting them results in p — q = ul + vm + wn. 
Multiplying by its complex conjugate gives the squared modulus 

(p - q) (p* - q*) = (u 2 + v 2 + w 2 ) R 2 + uvlm* + uvml* + vwmn*+vwnm*+wunl*+wul n*. As 

u + v + w = 0, u 2 + v 2 + vi' 2 = -(2 nv + 2vw + 2w«). Applying this substitution gives 

(p - q) (p* - q*) = H (m v(/ m* + m l* -2 R 2 )) = (u v(l - m) (Z* - m*)). The final expression is equal to 


eye 

-u v c 2 - v w a 2 - w u b 2 , as required. 


eye 


r x(x + y + z) 10 0 
y(x + y + z) 0 10 
z(x + y + z) 0 0 1 

K a 2 yz + b 2 zx + c 2 xy 0 0 0 
neatly multiplies out to give -a 2 yz-b 2 zx-c 2 xy = Power( P. ABC). 


22. Using Goucher’s theorem, we have det 


[ABC] 

[ABC] 


Powerf / J , ABC). This 


23. O I~ - R~ is minus the power of I with respect to the circumcircle of ABC, so is equal to 


a 2 y z+b 2 Zx+c 2 xy a 2 b c+b 2 c a+c 2 ab abc 


[x+y+zf 


0 a+b+c) 2 


a+b+c 


. We can express this in terms of R, r and ,v, obtaining 


4 fir 


= 2Rr. 


24. 


a 2 yz+b 2 zx+c 2 xy a 2 b c-b 2 ca-c 2 a b 


(*+v+z) 2 


(—a+b+c) 2 


2 Rrs 


(- a+b+c ) 


= -2 Rr A . 


25. Using Euler’s formula on each tritangential circle, this is equal to 4 R 2 + 2 R(r A + r B + r c - r). It was proved 
in an earlier exercise that r A + r B + r c - r = 4 R. 

26. The orthocentre has unnormalised areal coordinates (tan A, tan B , tan C). Hence, minus the power of H with 

. . . . a 2 y z+b 2 z x+c 2 xy a 2 tan B tan C+b 2 tan C tan A+c 2 tan A tan B a 2 cot A+b 2 cot B+c 2 cot C 

respect to the circumcircle is 


(x+y+zr 


tan 2 A tan 2 B tan 2 C 


tan A tan B tan C 


Remembering that cot A = b ^ B ^ , the numerator is equal to “ b + b —~—= 4[A B C], (The last 

step is from squaring Heron’s formula.) This results in R 2 - O H 2 = 4[A B C] cot A cot B cot C. For the 
second part of the problem, we use [AB C] = -2 R 2 sin A sin B sin C. Substituting this into the previous 

formula gives us 8 R 2 cos A cos B cos C, as required. 


27. As the question is a homogeneous function in areas, we can apply an affine transformation and consider the 
case of the equilateral triangle. In areal coordinates, the inequality becomes y[~c > \Ta + V7T. The set of 
points for which (\[~a + ypb + y[~c j [~a + x[~b + y[c ~j (xTa - x[~b + {xTa + y[~b - Vc~) is non¬ 
negative is the interior of the conic a 2 + b 2 + c 2 < 2 a b + 2 b c + 2 c a. As the conic passes through , 0), 
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16 ’ 6 ’ 3 


, -) and cyclic permutations thereof, it must be the incircle. Using the formula for the area of a circle, 


VT 


we have that the probability that it lands within the conic is ——. Hence, the probability that it lands 
outside the conic must be J n . However, this could occur if any of yfc > yj~a + yfa~ > yfb~ + Vc” 


or 


yf~b > V7 + \Ta are true. By symmetry, we actually want one-third of this, namely 


9-nVT 


27 



28 . Invert about B. The three circles are mapped to lines through A ' (the inverse of A), and the line / remains 
invariant. The cross-ratio (oo P 2 P \', P 2 ') is independent of /, as we can view A' as a projector. This is the 
same as the original cross-ratio (oo, P 2 \ P\, P 2 ), so that must also be independent of 1. As one of the points 
is infinity, the simple ratio P\ P 2 : P 2 P 2 also remains constant. 

29 . By Huygens-Steiner, £(m ; fVI(r,-, P)) = YA m i M(T,, G)) + P G 2 . Subtracting 2 Yi[ m i r 2 ) from each side gives 
us Yfanj Power! P, T,)) = X( m i Power(G, T,)) + P G 2 , or Power!F) = Power(G, F) + P G 2 , so G must be 
the centre of F. 

30. The three circles all pass through the orthocentre H, so H must lie on F. 0,\ Ob Oc is a dilated copy of the 
medial triangle LM N, which has orthocentre O, therefore we can deduce that O is the orthocentre of 

Oa Ob Oc- As H is the circumcentre of O r \ Ob Oc, the centroid of O r \ Ob Oc (and thus centre of F) must be 
the point Q on the Euler line of ABC halfway between G and H. As F also passes through H, it must 
necessarily be the Euler-Apollonius lollipop. 

31. This is the weighted barycentre of the Euler-Apollonius lollipop and the circumcircle, where the former has 

3 1 

mass - and the latter has mass -. Hence, F must have centre T, i.e. the centre of the nine-point circle. Let X 

be the radius of the barycentric circle F. Using the radius formula, we have 

X 2 = - (GQ 2 -T <2 2 ) + “ (R 1 ~ O 7’ 2 ). Let p = T Q. By the basic ratios along the Euler line, this is 

^ (4 p 2 - p 2 ) + ^ (7? 2 - 9 p 2 ) = ^ R 2 . Hence, X = ^ R and thus F is the nine-point circle. 







Mathematical Olympiad Dark Arts | 133 


The Riemann sphere 

We can augment the complex plane by adding an extra point, oo, to the plane. This produces the complex projec¬ 
tive line, CP 1 . It can be considered to be a projective space represented by a pair of complex coordinates, ( x , y), 
where scalar (complex) multiples are considered equivalent. Hence, for all points except oo, we can normalise the 

coordinates as , 1 j. We identify these points with points on the ordinary complex plane by letting z = -. If 
y = 0, this results in the point at infinity, oo. We consider a line to be a generalised circle passing through co. 

Two-dimensional inversion 


Inversion is essentially a reflection in a circle. Points outside the circle are interchanged with points inside the 
circle in an involution of the plane. 

■ For a circle T with centre 0, we define the inverse point of P to be the point P' on the line 0 P (at the same side as P 
from 0) such that 0 P-0 P' = R 2 3 4 , where R is the radius of T. [Definition of inversion] 

/?2 

If O is the origin, the inverse point can be found by the simple transformation z -» —. 



1. Show that O and oo are inverse points with respect to any circle centred on O. 

2. Demonstrate that (A, B\ P, Pj is a harmonic range. 

3. If P and Q invert to P' and Q\ show that P, P', Q and Q' are concyclic. 

4. Show that P' Q' = pgg 0 ■ [Inversion distance formula] 
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B C 


5. Let ABC DE be a regular pentagon with side length 1. Let the diagonals B D and C E intersect at A ', and 
define B\ C", D' and E' similarly. Show that: 
mBE = cp- 


\ D' C' — ■ 


»BC'= 1; 

where f is the positive root of the equation x 2 - x - 1 = 0. (f = 1+ ^~ ^ 1.618034.) 


Mobius transformations 


We define a Mobius transformation to be a transformation of the form z -> where a, b, c, d e C. It is also 

necessary to include the condition that ad - be t 0, to remove degenerate singular non-invertible cases. We can 
assume without loss of generality, therefore, that ab - cd- 1. 

6. Show that the composition of any two Mobius transformations is another Mobius transformation. 

7. Prove that the composition of any inversion followed by any reflection is a Mobius transformation. 


8 . 


Show that, for four points w, x, y, z e C, the value of 


(' w-x)(y-z ) 
(x-y) ( z-w ) 


remains invariant when a Mobius 


transformation is applied. 


Indeed, this follows naturally from the fact that Mobius transformations are projective transformations of the 
complex projective line. Hence, it is possible to find a unique Mobius transformation mapping any three points to 
any other three points. 


9. Demonstrate that generalised circles remain as generalised circles under any Mobius transformation, and 
thus under inversion. 


Like all non-trivial rational functions of z, Mobius transformations are conformal maps, which means angles 
between curves are (in general) preserved. Inversion reverses the direction of directed angles, but preserves the 
magnitude. This property can be derived from the fact that generalised circles remain as generalised circles. 
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10. Suppose Pi P 2 P 3 P 4 is a cyclic quadrilateral. The circle T„ passes through P„ and Pn+U with subscripts 
considered modulo 4. Circles T„ and T „ +1 intersect again at Q n+ \. Prove that (Q\ Qi Q 3 Q 4 are either 
concyclic or collinear. [Miquel’s theorem] 

11. Let ABC be a triangle with a right-angle at C. Let C N he an altitude. A circle T is tangent to line segments 
B N, C N and the circumcircle of A B C. If D is where F touches B N, prove that C D bisects angle LBC N. 

[NST2 2011, Question 3] 


Ivan’s 25 circles 


Consider a cyclic quadrilateral ABC D with circumcentre O. Let A B intersect C D at P. Similarly, we define 
Q = B C f| DA and R = A C f| BD. 

12. Prove that P is the pole of Q R, and hence that O is the orthocentre of P QR. [Broeard’s theorem] 

Denote the feet of the altitudes from P, Q and R with X, Y and Z, respectively. Then, we obtain six cyclic quadri¬ 
laterals from the fact that O is the orthocentre of P QR. 

13. Prove that inversion about the circumcircle of ABC D interchanges O with 00 , P with X, Q with Y and R 
with Z. 

14. Hence prove that O AB X are concyclic. 

By symmetry, this gives us six cyclic quadrilaterals, increasing the total to twelve (excluding ABCD ). There are 
still another twelve circles on the diagram to be found. 

15. Show also that B C X Rare concyclic. 

By symmetry, this increases the total of cyclic quadrilaterals to 24 (excluding ABCD). These circles were 
discovered by Ivan Guo. 
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As well as inverting about 0, one can also invert about P, Q or R to permute the vertices. 


Harmonic quadrilaterals 


16. Let ABC D be a cyclic quadrilateral. Choose a point P on the circumcircle of ABC D and a line A outside 
the circle. The line PA meets A at A'; points B', C’ and D' are defined similarly. Show that the cross-ratio 
(A', C / ; B'. D') does not depend on the locations of P and A. 



A 

If (A', C'\ B', O') = -1, then the cyclic quadrilateral ABCD is known as a harmonic quadrilateral. Harmonic 
quadrilaterals have many nice properties: 

17. Let ABC D be a convex cyclic quadrilateral. Let the tangents to the circumcircle at A and C meet at £; let 
the tangents to the circumcircle at B and D meet at F. Let the diagonals A C and B D intersect at P. Then 
show that the following properties are equivalent: 
m AB C D is a harmonic quadrilateral; 

m AB ■ CD = B C ■ D A = - AC ■ BD\ 

2 

■ E, B and D are collinear; 

■ B, D and K are collinear, where K is the symmedian point of A BC\ 

■ (P, E- B, D) = - 1. 
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18. If a quadrilateral ABCD is represented by complex numbers a, b, c and d in the Argand plane, show that it 
is harmonic if and only if (a -b)(c- d) + (b - c) (d - a) = 0. 

19. Deduce that harmonic quadrilaterals/ranges remain harmonic after inversion. 

The collinearity of E, B, K and D gives us an elegant construction of the symmedian point: let the tangents to the 
circumcircle at B and C intersect at A', and define B' and C' similarly. The symmedian point is then the intersec¬ 
tion of A A', B B' and C C'. 



Generalised spheres 


We started by defining Mobius transformations and inversion in the environment of complex numbers. However, 
complex numbers are restricted to two (real) dimensions, so cannot be used to generalise the ideas to n-dimen- 
sional space. Instead, we will need to consider this from a more Euclidean perspective. 

When discussing objects in n-dimensional space, we use the following conventions: 

• A n-ball comprises all points in IR" within a distance of R from a point O. 

• The surface of a «-ball is a (n - \)-sphere. 

• The set of points in R" obeying a single linear equation is a (n - 1 )-plane. 

• A generalised (n - \)-sphere can be a ( n - l)-sphere or a (n - l)-plane. 

20. Show that the intersection of two generalised (n - l)-spheres in R" is either empty, a single point, or a 
generalised (n - 2)-sphere lying in a (n - l)-plane of R". [Intersection of generalised spheres] 

21. Let T be a (n - l)-sphere and P a point in R". Lines l\ and I 2 pass through P. l\ intersects T at A and B\ I 2 
intersects T at C and D. Prove that P AP B-P CP D. [n -dimensional intersecting chords theorem] 

22. If a point P e R" has equal power with respect to non-concentric (« - l)-spheres T 1 and T 2 , then it lies on 
their radical axis. Show that the radical axis of two identical (n - l)-spheres is a (« - l)-plane. 

This forms the basis of an ingenious proof by Geza Kos that the radical axis of any two (non-concentric) circles 
on the plane A is a line. Lirstly, erect two spheres of equal radius, T 1 and T 2 , on the circles. 
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The radical axis of the two circles is the intersection of the radical plane of Tj and T 2 with the plane A. This 
construction clearly generalises to two non-concentric (n - l)-spheres in IR", by embedding the situation into R" +I 
with equiradial n -spheres. 


n-dimensional inversion 


We are now in a suitable position to define n -dimensional inversion and investigate its properties. 

■ Let T be a (n - l)-sphere in IR" with centre O and radius R. For any point P e R", the inverse point P' is defined to lie 
on the line O P at the same side of O such that O P O P' = R 2 . [Definition of n-dimensional inversion] 

23. Prove that generalised (n - l)-spheres map to generalised (n - l)-spheres under inversion. 

24. Draw a generalised (« - 2)-sphere on the surface of some generalised (n - l)-sphere in IR". Prove that, after 
inversion, this will remain a generalised (n - 2)-sphere. [Backward compatibility of inversion] 

The oblique cones in the diagram below intersect both the plane and the sphere in circular cross-sections, as the 
sphere and plane are inverses with respect to P. This idea of projecting the sphere onto a plane from a point on the 
sphere is known as stereographic projection. 



The diagram above enables us to easily define Mobius transformations. Let P be an arbitrary point outside the 
plane A. An arbitrary Mobius transformation of the plane A is the composition of: 

• A two-dimensional translation and/or homothety of the plane A; 

• An inversion about the unit sphere centred on P, transforming A into a sphere LI (the Riemann sphere); 

• A rotation of the sphere Li about its centre (P does not move with the sphere); 
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• An inversion about the unit sphere centred on P, transforming fl back into A. 

There are six degrees of freedom in this transformation, so they must represent all Mobius transformations, and 
nothing else. This definition of a Mobius transformation clearly generalises to R" (with - (n + 1) (« + 2) degrees 
of freedom), whereas the complex number definition does not. 

25. Suppose two smooth curves are drawn in R", which intersect at a point. Prove that the angle of intersection 
is preserved (or, more accurately, reversed) after inversion. [Anti-conformal map] 


Kaleidoscopes 


You may have encountered kaleidoscopes, where mirrors are used to form lots of repeated copies of a 
‘fundamental region’. For example, we can generate patterns with the same symmetries as the hexagonal tiling by 
using a triangle of mirrors with internal angles of ^ 7 r, ^ it and | it. Mathematicians regard this as a group of 

symmetries, generated by the three reflections. If we call the reflections a, P, y. then we have the relations 
a 2 = p 2 = y 2 = /, where I is the identity element; a reflection is its own inverse. Our three rotations, a /i, py and 
ya, together generate half of the symmetry group, namely the group of direct congruences. We also have 
(a P) 2 = (/?y) 3 = (y a) 6 = /, as applying a rotation of 2 it is equivalent to the identity. 





/ i 

> / 

/• 



\i 

• \ 


X 

m, 

• / 

/ i 

> / 


X 

\i 


In Euclidean geometry, the interior angles of a triangle necessarily sum to it. However, if we allow circular arcs 
instead of straight lines, this condition can be relaxed. For example, we can have a curvilinear triangle with 
interior angles of ^ it, ^ it, ^ it: 



Reflection in the circular arc is simply inversion, and reflections in the straight lines are ordinary reflections. In 
this manner, we generate a spherical reflection group, namely the symmetry group of a dodecahedron. The 
compositions aP, Py and ya are no longer necessarily rotations, but are instead Mobius transformations. For 
this group, (a P) 2 = (P y ) 3 = (y a) 5 . 
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The above picture of icosahedral symmetry may look distorted, not least because we have flattened the Riemann 
sphere into a plane by stereographic projection. Finally, we can produce hyperbolic tilings by ensuring the interior 
angles of the fundamental triangle sum to less than n (by using concave arcs); this visualisation of the hyperbolic 
plane is known as the Poincare disc model. 



We can define ‘distances’ and ‘angles’ in hyperbolic geometry. As hyperbolic space must be invariant under any 
Mobius transformation mapping the unit circle to itself, distances and angles must also be preserved. We already 
know that complex (or cyclic) cross-ratio is invariant under Mobius transformations. The (directed) hyperbolic 
distance between two points, A and B, is given by the logarithm of the cyclic cross-ratio (A, B; X, Y), where X 
and Y are the two intersections of the circle ABB' A' (where ' indicates inversion in the unit circle) with the unit 
circle. Note that ABB' A' is orthogonal to the unit circle. 
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■ The hyperbolic distance between two points is given by A B = | ln(/i. II: X, Y) |. [Definition of hyperbolic distance] 

A hyperbolic line, i.e. the shortest path between two points A and B, is the arc AB of the circle AB B' A'. Hence, 
the triangle inequality applies: for three points A, B, C,AB + BC>AC, with equality if and only if they are 
collinear and in the correct order. As angles are preserved under Mobius transformations, the angles in the 
Poincare disc model are the same as those in the hyperbolic plane. 

■ The hyperbolic angle between two hyperbolic lines is identical to the ordinary angle between the corresponding 
circular arcs on the Poincare disc model. [Definition of hyperbolic angle] 

With these principles, it is possible to explore the rich world of Bolyai-Lobachevskian geometry. Four of Euclid’s 
postulates (basic assumptions from which all of geometry can be derived) hold in hyperbolic geometry, whereas 
the fifth postulate does not. The fifth postulate is equivalent to the interior angles of a triangle summing to n. 


Steiner’s porism and Soddy’s hexlet 


We have already encountered Poncelet’s porism, which states that if there is one «-gon inscribed in one ellipse 
and circumscribed about another, then there are infinitely many. Another example of a porism (a term which 
people struggle to define) is Steiner’s porism, again named after Jakob Steiner. 
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26. Let Ti and 1L be two circles, such that Ti is contained within r 2 . A set of/? > 3 circles {Ci, C 2 , C„[ is 

known as a Steiner chain of length n if each C, is tangent externally to C !+ i, C,-_i (where subscripts are 
considered modulo n) and Y 1 , and is tangent internally to IT. Show that if there exists one Steiner chain of 
length n for two given circles, then there exist infinitely many. [Steiner’s porism] 

There is an analogous three-dimensional porism, which is less general but more interesting, called Soddy’s hexlet. 
This also appeared as a Japanese Sangaku problem. 

27. Let Ti, Y 2 and T^ be three mutually tangent spheres. A set of n > 3 spheres {Si, S 2 , ■ ■ ■, S„ j is known as a 
Soddy chain of length n if each S, is tangent externally to S I+ i, S,_ 1 (where subscripts are considered modulo 
n), Ti, Y 2 and T 3 . Show that infinitely many Soddy chains exist of length 6, and no Soddy chains exist for 

n =t 6. [Soddy’s hexlet] 



The six green spheres {Si, ..., S (] [ are tangent to a quartic doughnut-shaped surface known as a Dupin cyelide, 
which is an inverted torus. 
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Solutions 


1. Assume, without loss of generality, that O is the origin of the complex plane. Then, we have 0 ->— = ^ = oo. 


A P BP' 

2. We can compute the linear cross-ratio — • — = 

PB PA 


(R+p) 


(f*) 


-Rp 


( R 2 \ R 3 

-7-*) -7 +r p 


= -l. 


3. This is a trivial corollary of the intersecting chords theorem. 

4. P' Q' O is similar to QP O, so we have . By using the identity P O P' O = R 2 , we obtain the 

desired formula. 


5. Let the length of the diagonals be denoted by x. By Ptolemy’s theorem on B C D E, we have 1 + x = x 2 , so 
x = cp. Note that D' and C are inverse points with respect to the unit circle centred on A. Hence, we obtain 

BC 1 BD <p , CD 1 


BD' = 


ABAC 


= i;BC' = 

<p 


= - = 1 ; 
A BAD 0 


D'C' = 


ACAD 


r 


6 . Mobius transformations are projective transformations of the complex projective line, thus the composition 
is simply the product of their matrices. 

7. Translations composed with rotations and dilations are Mobius transformations, as they have the form 

z -> a z + b. Hence, we need only consider the case where the inversion is in the unit circle, and thus has the 
form z -* 4 ■ Composing this with the general form of an indirect similarity, z -» a z* + b, results in the 
transformation z -» - + b, which is clearly a Mobius transformation. 

8 . Mobius transformations can be regarded as a projective transformations of the complex projective line, so 
must necessarily preserve complex cross-ratio. 

9. The condition for four points to be concyclic or collinear is that the complex cross-ratio, , is real. 

As it is preserved under Mobius transformations, so must the property that four points are concyclic or 
collinear. Obviously, a reflection also preserves this property, so inversions (compositions of reflections and 
Mobius transformations) must also do so. 

10. Inverting about Pi reduces the problem to the pivot theorem. 

11. By Thales’ theorem, AB is a diameter of the circumcircle, so is orthogonal to it. Invert about C. A' B' and 
N' 00 ’ are diameters of the same circle (the inverse of line A B), so A' N' B' 00 ' forms a rectangle. The 
inverse of T is tangent to the diagonals and circumcircle of the rectangle A' N' B' 00 so lies on one of the 
lines of symmetry of the rectangle. The point D' must thus be the midpoint of the minor arc B' N', as D lies 
between B and N in the original diagram. As arcs B' C' and C" N' are congruent, they must subtend equal 
angles at 00 '. As angles are preserved under inversion, we are finished. 

12. Let Q R intersect A B and C D at E and F, respectively. From applying a projective transformation to 
convert AB C D into a square, we can deduce that ( P , E\ A, B) and (P, F: C, D) are harmonic ranges. This is 
sufficient for Q R to be the polar of P. Hence, O P is perpendicular to Q R. By symmetry, O must be the 
orthocentre of P Q R. 
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13. O is the centre of the circle of inversion, so is exchanged with the point at infinity. X is the intersection of 
O P with the polar of P, so they must be inverse points. By symmetry, we obtain the other two pairs of 
inverse points. 

14. Inverting about the circle ABCD maps O and X to oo and P, respectively. ABP is a straight line and thus 
passes through oo, so the original four points were concyclic. 

15. Note that lines BC,OY and X R concur at Q. Applying the converse of the radical axis theorem to circles 
OYBC and ROXY, we obtain the concyclicity of X RBC. 

16. It obviously does not depend on A, as we can view P as a projector, and moving A is simply applying one¬ 
dimensional projective transformations to the line A' B' C' D'. Hence, the cross-ratio is dependent only upon 
the angles A PB, B P C and CPD. Due to basic circle theorems, these are invariant as P moves on the 
circumcircle of ABC D. 

17. Inversion about P and considering similar triangles derives the first of these results. To show that the 
collinearity of E, B and D implies that the quadrilateral is harmonic, use the intersecting chords theorem to 
show that A E D is similar to BE A, and that DEC is similar to C E B. We then have 

DA / AB = DE / AE = D E / C E = C D / B C, thus the products of opposite sides are equal. The 
converse follows trivially. Showing that P is in harmonic range with E, B and D stems from the fact that P 
lies on A C, which is the polar of E. To demonstrate that K lies on B D, it is sufficient to show that P is the 
foot of the symmedian from B to AC. This is equivalent to the statement that AP/AB 2 = CP/C B 2 . To 
prove this, we can exploit similar triangles to show that 
AP/AB = DP/CD = (P A-P C/A B)/(B C-D A/AB), whence it follows that 

AP/(AB 2 ] = (PA-PC)/(PB-BC-DA) = (PAPC)/(PBBA-DC) = CP/(CB 2 ). This shows that/ 5 has 
the required areal coordinates to be the foot of the symmedian. 

18. Consider when the complex cross-ratio (a - b)(c - d)/(b - c)(d - a) = - 1. From here, we use polar 
coordinates, resulting in arg(a -b)/ arg(c - b) = arg(a - d)/ arg(c - d) and 

\a-b\ | c - d | = | b - c \ \d - a \. These are equivalent to lABC = LADC and AB ■ C D = B C D A, 
respectively. Clearly, the first of these is the ‘angles in the same segment’ criterion for concyclicity, and the 
latter is condition for a cyclic quadrilateral to be harmonic. 

19. A composition of an inversion and a reflection is a Mobius transformation, i.e. a map of the form 

z -> (a z + b)/ (c z + d). This can be regarded as a projective transformation of the complex projective line, 
so must necessarily preserve complex cross-ratio. 

20. For two (n - l)-planes, the intersection is either empty (if they are parallel) or the (n - 2)-plane formed by 
solving their algebraic equations simultaneously. For two (n - l)-spheres, consider all 2-planes passing 
through both centres. If the circles are disjoint, so are the original spheres. If the circles are tangent, the 
original spheres share a single point. If the circles intersect in two points, then let the radical axis meet the 
line of centres at P. Clearly, all intersection points of the original two spheres must lie on the (n - l)-plane 
through P perpendicular to the line of centres, and must all be equidistant from P. For the case of one 

(n - l)-sphere and one (n - l)-plane, we can reflect the sphere in the plane and reduce it to the previous case. 

21. Consider the plane containing Ii and U, and apply the two-dimensional intersecting chords theorem to this 
configuration. 

22. The power of a point is equal to O P 2 - R 2 . When the spheres are of equal radii, the equation for the radical 
plane becomes O P\ 2 = O P 2 , which is the locus of all points equidistant from the centres of the two 
spheres. This must be the plane of reflective symmetry of the configuration. 
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23. Draw the line Z through 0 orthogonal to the generalised (n - l)-sphere, intersecting it at A and S (one of 
which may be at infinity if the generalised sphere is a plane). Let the inverse points be A ' and S', which also 
lie on Z. Every plane containing Z must intersect the generalised (n - l)-sphere in a generalised circle of 
diameter A S; this inverts to a generalised circle of diameter A’ B' using ordinary two-dimensional 
inversion. The union of all such generalised circles is the generalised (n - l)-sphere of diameter A' B'. 

24. Consider the generalised (« - 2)-sphere to be the intersection of two (« - l)-spheres. The result follows 
from the fact that the intersection is either empty (impossible), a single point (impossible) or a generalised 
(n - 2)-sphere. 

25. We can reduce this to the 3-dimensional problem, as we need only consider the 3-plane containing the 
tangents to the two curves, l\ and L, and the centre of inversion, O. Assume they intersect at P with an angle 
a. After inversion, the plane l\ la becomes a generalised 2-sphere, and generalised circles /1 and I 2 intersect 
at P' with an angle /3. If the generalised 2-sphere is a sphere, we invert about the point diametrically 
opposite to S', preserving P' whilst mapping l\ and h back to lines. Clearly, the lines now intersect at an 
angle -p. This orientation-preserving generalised-circle-preserving map from the Riemann sphere to itself 
must necessarily be a Mobius transformation, and thus preserve angles. Hence, a = -ft, and we are done. 

26. Let the line of centres Z intersect Tj at B and C, and T 2 at A and D, such that ABC D are in that order along 
Z. Note that the two circles intersect Z orthogonally. Let A' B' and C' D' be two unit lengths on the same line 
Z', and separate them such that (A', D'\ B'. C") = (A, l): B, C). Apply a Mobius transformation to map A to 
A', S to S' and C to C'. As the cross-ratios are equal, D must necessarily map to D'. The two circles are now 
both orthogonal to Z', so the centres both lie on this line. As A' D' and S' C' share a midpoint, the centres 
must coincide and the circles are concentric. We can now ‘rotate’ the Steiner chain within the annulus by an 
arbitrary angle, similar to a ball bearing. Applying the inverse Mobius transformation will restore T 1 and T 2 . 

27. Invert about the tangency point of T 1 and T 2 , resulting in two parallel planes sandwiching the sphere T 3 . All 
spheres tangent to T 1 and T 2 must have diameter equal to the separation of the planes. Hence, their centres 
must be coplanar with T 3 and have equal radius. Hence, we can consider this to be a two-dimensional 
problem of packing a closed loop of n circles of unit radius around a circle of unit radius. As the centres of 
T 3 , Si and S ,-+1 must form an equilateral triangle, this forces the angles to be - and thus there must be a 

regular hexagon of six spheres. There are infinitely many orientations in which this can be done. 
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Diophantine equations 

So far, we have considered solutions to equations over the real and complex numbers. This chapter instead 
focuses on solutions over the integers, natural and rational numbers. There is no algorithm for solving a generic 
Diophantine equation, which is why they can be very difficult to solve. In fact, many open problems such as the 
Riemann hypothesis can be embedded in questions about Turing machines, which can in turn be converted to 
hideously complicated Diophantine equations. Even proving the non-existence of positive integer solutions to the 
innocuous-looking equation x n + y" = z" (n > 3) (Fermat’s last theorem) occupied mathematicians for three 
centuries before finally being settled by Andrew Wiles. 

We will adopt a geometric approach to the problem, locating points in Z", fT and Q" lying on some particular 
curve. The simplest curve is a straight line (or plane, or hyperplane), corresponding to a linear Diophantine 
equation. Linear Diophantine equations are easy. Let’s consider the example 3 x + 5 y = 1. 



We can see that this has integer solutions, such as ( x , y) = (2, -1). In general, any linear Diophantine equation 
with coprime coefficients has infinitely many solutions in the integers, which can be found using the Chinese 
remainder theorem. If the coefficients are not coprime, such as 6 x - 4 y = 5, there may be (as in this case) no 
solutions. If there are no solutions, a simple proof exists using modular arithmetic. 

■ A point ( x , y) is rational if and only if both x and y are rational. 

1. Let T be a conic with rational coefficients, and let A be a rational point on T. If B is another point on T, 
show that B is rational if and only if AB has rational gradient. 

This theorem enables us to find all integer solutions to a 1 + b 2 3 = c 2 , known as Pythagorean triples as they 
correspond to right-angled triangles with integer side lengths such as the famous (3, 4, 5) triangle used as a set 
square by the Ancient Egyptians. We can apply the geometrical concept of stereographic projection. 



2. Show that all rational points on the unit circle can be obtained by inverting the rational line, as shown above. 

3. Find all rational solutions to x 2 + y 2 + z 2 = 1. 
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4. Hence find all integer solutions to a 2 + b 2 - c 2 , where gcdfa, h, c) = 1. [Elementary Pythagorean triples] 

5. Show that there exists an infinite set S of points, no three of which are collinear, such that the distance 
between any two points is rational. 

Vieta jumping 


Suppose we are attempting to determine whether or not a certain quadratic equation in two variables has integer 
solutions. This can be visualised as a conic section, which is either an ellipse, parabola or hyperbola. As an ellipse 
is finite, we need only check a few pairs of values for integer solutions. To deal with the parabola, we can change 
the coordinate system by applying an affine transformation to convert it into a simpler equation. For the hyper¬ 
bola, however, it is necessary to adopt a more sophisticated technique such as Vieta jumping. 

For example, consider the equation a 2 + b 2 = k{a b + 1), where A: is a fixed positive integer. 

6 . Find all integer solutions to the equation a 2 + b 2 = k(a b + 1) for the cases where k = I and k = 2. 

We will now consider the cases where k > 3. 

7. Show that there are no integer solutions to a 2 + b 2 = k(a b + 1) where k > 3 and one of a, b is negative and 
the other is positive. 

8 . Sketch the curve T described by the equation x 2 + y 2 = k(x y + 1), for k> 3. 



9. Suppose that P = ( a , b) is a positive integer solution, and draw a vertical line through P. Show that it meets 
T again at another non-negative integer point, Q = {a, c). Also, if b > a, then show that c <b. 

This is the principle behind Vieta jumping. We start with some (hypothetical) solution, then use it as the basis to 
construct a smaller solution until we reach a contradiction. Fermat employed this process of infinite descent to 
prove that there are no solutions in the positive integers to x 4 + y 4 = z 4 . Euler later refined the approach to apply 
to the equation x 3 + y 3 = z 3 . 
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10. If a 2 + b 2 = k(a b + 1) (for some k > 3) has a solution in the integers, then show that there is a solution 
where b = 0 . 


11. Let a and b be positive integers. Prove that if 


a 2 +b 2 
ab +1 


is a positive integer, then it is a perfect square. [IMO 


1988, Question 6] 


12. Let a and b be positive integers. Prove that if 


a 2 +b 2 +1 
ab 


is a positive integer, then it equals 3. 


Pell equations 


Vieta jumping is primarily useful for proving the non-existence of solutions to hyperbolic equations. What if, 
instead, we want to find an infinite family of solutions? The idea is to create new solutions from old by some form 
of recurrence relation. Firstly, consider equations of the form x 2 = n y 2 + 1, where n is not a perfect square. These 
are known as Pell equations , even though Pell had absolutely nothing to do with them. Basically, someone 
wanted to solve these equations, so told Euler that Pell was working on them. Consequently, Euler solved the 
equations, but attributed them to Pell. 

13. Consider the complex numbers z = a + b V —n and w = c + cl f -n ■ Prove that 
(a 2 + n h 2 ) (c 2 + n cl 2 ) = (ac - bd n) 2 + n(a d + be) 2 . [Brahmagupta’s identity] 

Brahmagupta’s identity can also be verified algebraically, so is true even when n is negative. This gives us the 
related identity (a 2 - n b 2 ) (c 2 - n cl 2 ) = (ac + bd n) 2 - n(acl + b c ) 2 . 

14. Show that we can generate infinitely many solutions (in positive integers) to x 2 = n y 2 + 1 (where xpn <$. N) 
from one known solution. 

15. Hence prove that there are infinitely many square triangular numbers. 

As x and y can be very large, we obtain very good rational approximations - =* V7T in this manner. These rational 

approximations can also be generated by analysing the continued fraction expansion of sfn", i.e. the unique 
expression of sfn of the following form: 

1 

x =-1 a i 

—I-1 a 2 

-+«3 

We write this as [cq; ai, « 3 , « 4 , ...]. The sequences of all quadratic irrationals (solutions to quadratic equations in 
integer coefficients) are eventually periodic, and the converse also holds. For example, the sequence 

[ 1 ; 1 , 1 , 1 , 1 , ...] corresponds to (f> = 1 ' , and truncating the sequence produces the approximations -^ ui , where 

2 F n 

F„ is the «th Fibonacci number. Every Pell equation has infinitely many solutions. This is not true in general of 
the negative Pell equation, x 2 = n y 2 - 1 (where sf~n f Rl); the negative Pell equation has solutions if the contin¬ 
ued fraction has an odd period length. 

The continued fraction for n is somewhat chaotic: [3; 7, 15, 1, 292, ...]. Truncating before the 292 yields the 

355 

approximation [3; 7, 15, 1] = [3; 7, 16] = ^, which agrees with the actual value of n to six decimal places, e has 
a very regular continued fraction expansion of [2; 1,2, 1, L 4, 1, 1, 6 , 1, L 8 , 1, ...]. 
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Sums of squares 

Brahmagupta’s identity, in the case where n= 1, provides the identity 
(a 2 + b 2 ) (c 2 + d 2 ) = (ac - b d ) 2 + (a d + b c) 2 as a special case. In other words, if S is the set of numbers that can 
be expressed as the sum of two squares (of integers, or, more generally, of rationals), then S is closed under 
multiplication. Using Hamiltonian quaternions, we can produce an analogous formula for four squares: 

■ ( a 2 + b 2 + c 2 + d 2 ) (e 2 + f 2 + g 2 + h 2 ) = 

(ae-bf-cg-dh) 2 + (af+be + ch - d g) 2 + (a g-bh + ce + d f) 2 + {ah + b g — c f + d e) 2 

[Euler’s four-square identity] 

Using even more bizarre eight-dimensional numbers called octonions, there is a similar identity for eight squares. 
However, there are no identities beyond this, as doubling the number of dimensions causes the numbers to lose 
their useful properties. Quaternions are non-commutative, i.e. xy + yx in general. Octonions are even worse, 
since they also lose associativity: x(y z.) t (x y) z. Beyond this, the numbers have no useful properties remaining, 
and the 2 "-square identity breaks down. 

It is interesting to see when an integer can be expressed as the sum of n squares of integers. Clearly, if N can be 
expressed as the sum of squares of n integers, then it can also be expressed as the sum of squares of n + 1 integers, 
as we can set one of those equal to zero. This gives a nested hierarchy: 

■ A positive integer N can be expressed as the sum of one square if and only if it is a perfect square, i.e. N = a 2 for 
some a e Rl. [Trivial one-square theorem] 

■ A positive integer N can be expressed as the sum of two squares if and only if it can be expressed as 

N = a 2 2 k (pi p 2 ... p n ), where each p t is a prime congruent to 1 (modulo 4) and a and k are non-negative integers. 

[Fermat’s Christmas theorem] 


■ A positive integer N can be expressed as the sum of three squares if and only if it is not of the form 4 k (8 m + 7). 

[Legendre’s three-square theorem] 


■ Any positive integer can be expressed as the sum of four squares. [Lagrange’s four-square theorem] 


16. Generalise each of the above theorems to determine when a rational number is expressible as the sum of 
one, two, three or four squares of rationals. 


Sam Cappleman-Lynes technique 


Consider the Diophantine equation x 2 + y 6 = z 7 ■ If we have a solution to the equation a 3 + //’ = c (which is 
trivial), then we can multiply all terms by c 6 to obtain a 3 c 6 + //’ c 6 = c 7 , which is a solution to the original 
equation. This enables us to create infinitely many distinct solutions in this manner. This idea, known as the Sam 
Cappleman-Lynes technique, is applicable in many problems. 

17. Show that there are infinitely many solutions to x 4 + y 6 = z 10 in the positive integers. 

More generally, if a, b, c are pairwise coprime, then x ak + y bk = z ck has infinitely many solutions in the positive 
integers when k = 1 or k = 2. A consequence of Fermat’s last theorem is that there are no solutions for k> 3. 

18. Let A be the set of all integers of the form a 2 + 13 b 2 , where a and b are integers and b is non-zero. Prove 
that there are infinitely many pairs of integers x, y such that x 13 + y 13 e A but x + y $A. [Mongolian TST] 
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19. Determine whether there exists a set S of 2012 positive integers such that the sum of elements in each 
subset of S is a non-trivial power of an integer. [IMO 1992 shortlist] 

A large proportion of the problems solved using the Sam Cappleman-Lynes technique reduce to this (very 
general) theorem, which is proved first by applying linear algebra to the simultaneous equations followed by a 
similar approach to the last question. 

■ Suppose we have a system of m equations in n variables (x\, xi, ..., x n ) of the following form, where m<n: 

■ r (i,D x i a ' + r (i,2) x 2 ai + ••• + r ( i in ) x n a " = 0; 

■ r ( 2 ,i) x \ a ' + r ( 2 ,2) x 2 ° 2 + ... + r (2 ,„) = 0 ; 


■ '•(„,!) xf' + r (m? 2 ) x 2 “ 2 + ... + r M x n a " = 0. 


Suppose that the following conditions are also true: 

■ All of the coefficients, rpj\, are rational (not necessarily all non-zero); 


i All of the exponents, a ; , are positive integers; 

'■(1,1) '(1,2) ■■■ r (l,m) 

r<2,l) >\2,2) ■■■ r {2,m) 


i det 


I'im. I) x (m,2) 




* 0 ; 


■ For all i < m and i < j < n , a,- is coprime with ay, 
m There is at least one solution in the positive real numbers. 

In that case, there are infinitely many solutions in the positive integers. [Generalised Sam Cappleman-Lynes 
theorem] 


The invertibility of the matrix enables us to apply elementary row operations to reduce it to a diagonal matrix in a 
process known as Gauss-Jordan manipulation. We then have m equations of the form 
x i a ‘ = < 7 (;,m+i) x m+ i flm+1 + <?(;,m+ 2 ) x m+ 2 “ m+2 + • ■ ■ + q(i,n) x n ( one equation for each 1 < i < m), and a real solution to 
the equations. This is a linear equation in x, a ‘ (for all 1 < i < n), so can be represented by a (n - »i)-dimensional 
hyperplane in n-dimensional space (which passes through the origin, like a projective (n - m - l)-hyperplane). 
The single real solution means that this hyperplane intersects the positive quadrant, so we can set initial rational 
values for each of the free variables {x m+ \, x m+ 2 , ..., x,,}- This forces every xf 1 to be positive and rational. We 
then apply the Sam Cappleman-Lynes technique to the equations, relying on the coprimality of various exponents. 
After all variables have been rationalised, we multiply out by a large power of the lowest common multiple of the 
denominators of the variables, turning them into integer solutions. 


Elliptic curves 


20. Let T be a non-singular cubic curve with integer coefficients. Suppose we have a line A which meets T at 
three points. A, B and C. Prove that if A and B are rational, then C is also rational. 

For example, the curve y 2 - x 3 - 4 x + 1 has rational points (0, 1) and (4, 7). We then draw the line through the 

3 

two points, namely y = - x + 1. This intersects the curve at a third point, which is the solution of the cubic 
equation x + l)~ = x 3 - 4 jc + 1, which simplifies to x 3 - ^ x 2 - 7 x = 0. We can factorise this, as we already 
know two of the roots, obtaining x(x - 4) [x + -) = 0. This gives us a third rational point on the elliptic curve, 
namely (-^, —j). 
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One of the great unsolved problems in mathematics, the Birch and Swinnerton-Dyer conjecture, is concerned with 
counting the number of integer points on an elliptic curve modulo p. Elliptic curves also featured prominently in 
Andrew Wiles’ proof of the Tanayama-Shimura conjecture and Fermat’s last theorem. Specifically, Wiles showed 
that every elliptic curve could be associated with a ‘modular form’, a complex function with the same hyperbolic 
symmetries as the Ford circles. Even the simplest, most elementary proof of Poncelet’s porism involves elliptic 
curves. 

We define a binary operation + on the points on the cubic curve, such that A + B + C = 0 for any three collinear 
points A, B, C e T. In other words, C = -(A + B). Let O = 0 be one of the points of inflection, so the line passing 
through C and O meets T again at C = A + B. It is trivially obvious that this operation is commutative, but 
proving associativity is a little more difficult. 

21. Let X, Y, Z be three non-collinear points on T. Show that (X + Y) + Z = X + (Y + Z), where addition is 
defined as in the last paragraph. [Associativity of elliptic curve operation] 

After proving associativity, parentheses can be omitted from expressions without ambiguity. For example, we can 
refer to the last expression simply as X + Y + Z. Addition of elements forms a group operation. We can then 
multiply by an integer n, by defining nX = X + X + ... + X. It is difficult to compute n from the points n X and 
X, so can form the basis of a cryptosystem similar to RSA using the group {T. +} instead of jz y , 9 , x). Elliptic 
curve cryptography is considered to be more secure than RSA. Again, it is susceptible to attacks from quantum 
computers. 

22. Let A, B, C be three collinear points on I . If A and B are both points of inflection, then show that C is also a 
point of inflection. 
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In the above diagram, a line is drawn through the three real points of inflection of a cubic curve. A general cubic 
curve on the complex projective plane has nine points of inflection lying on twelve lines in what is known as the 
Hesse configuration. It is a remarkable fact that this cannot be embedded in the real projective plane due to the 
Sylvester-Gallai theorem: if there is a finite set S of points such that no line contains exactly two points, then all 
points are collinear. Hence, a cubic curve has at most three real points of inflection. 
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Solutions 


1. Clearly, if A B has irrational gradient, then A is the only rational point on T. If A B has rational gradient, 
then we can express it as a linear equation in rational coefficients. By solving this simultaneously with the 
equation for T, we obtain a quadratic equation in rational coefficients for x (the abscissa). As one root of 
this (the coordinates of A) is rational, then the other root must also be. Repeating this process for y (the 
ordinate), it is evident that B e Q 2 . 

2. Inverting about a rational point A on the unit circle transforms the circle into a line by stereographic 
projection. All lines with rational slopes through A clearly correspond to the entire set of rational points on 
the real line. 


3. This is the three-dimensional analogue of the problem. We want to transform a point O A on the horizontal 
plane into a vector OB on the unit sphere by inverting about a sphere with centre C = (0, 0, 1) and radius 
V2~. We have that C A and C B are parallel, and the product of their lengths is 2. Hence, this gives us the 


(-, -, oj, where p, q and r are coprime, then we obtain 


formula CB = ~ CA . If we let A have coordinates | 

|CA | 2 

77T? 2 T“,’W t,, ■ ■ r," , 77T 2 (pq,rq,-q 2 ) ? — —+ ——* (2 p q,2r q,p 2 +r 2 -q 2 ) 

CB= * —. This simplifies to C B = —-—-—. Now, OB = OC + BC= --—— - - , giving us 


+i 


the complete set of solutions; jc = 
gcd(p, q,r)= 1 . 


2 pq 
i 2 +q 2 +r 2 


p^+q^+d 
2rq 


,y = - 


2 +q 2 +r 2 


p —q +r , 

, z = ———, where p, q, r 

p 2 +q 2 +m 


p 2 +q 2 +r 2 

E Z, q > 0 and 


4. a and b cannot both be odd; assume without loss of generality that a is even. Hence, b is odd as otherwise a, 
b and c would all be even. Using the previous question (and setting r = 0), the solutions to )” + (-)" = 1 

are - = 2 / g , and - = q . . If p and q are both odd then c is even, so one of p and q must be even. Hence, 

c p 2 +q 2 c p 2 +q 2 

p 1 - q 1 and p 2 + q 2 are coprime (by applying Euclid’s algorithm to obtain 2 p 2 and 2 q 2 , which only share 2 
as a common factor). This gives us the irreducible general solution; (a, b, c) = (2 p q, p 2 - q 2 , p 2 + q 2 ), 
where p and q are coprime integers. 


5. Let A B be the base of a semicircle with unit radius. A point C on the curved edge of the semicircle belongs 
to S if and only if both A C and B C are rational. (The previous question guarantees infinitely many choices 
of C.) If we have two such points, C\ and Ci, (such that A, C\, C 2 , B appear in that order) then 
Ci C 2 -AB = C\B-C 2 A-CiAC 2 B (by Ptolemy’s theorem), and thus Ci C 2 is rational. 


6 . For k- 2, the equation a 2 + b 1 = 2 a b + 2 simplifies to the (a - b) 2 = 2, which clearly has no integer 
solutions. For k = 1, a 2 + b 2 = 1 + a b is an ellipse. As a 2 + b 2 > 2 a b, then a b < 1. Also, a b > -1, since 
otherwise a 2 + b 2 would be negative. It is a simple matter to check all combinations of a, b e {— 1, 0, 1} and 
deduce that the solutions ( 0 , 1 ), ( 1 , 0 ), ( 0 , - 1 ), (- 1 , 0 ), ( 1 , 1 ) and (- 1 , - 1 ) are the only solutions. 


7. If one of a , b is negative and the other is positive, then ab < 0. Hence, a b < -1 and thus k(a b + 1) < 0. 
However, a 1 + b 2 = k(a b + 1) is strictly positive. 


8 . If we apply the substitution z = x + y,w = x - y then we obtain a hyperbola in standard position. Hence, the 
original curve is a hyperbola with centre ( 0 , 0 ) and an axis of symmetry y = x, which does not intersect the 
hyperbola on the real plane. 

9. As a 2 + b 2 = k(a b + 1), b is a root of the quadratic zr - kaz + (a 2 - l) = 0. By Vieta’s formulas, we have 

b + c = ka, and thus c = ka - b. This is obviously an integer point. As b > a, P is on the upper branch of the 
hyperbola, so Q must be on the lower branch and therefore c <b. 
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10. Assume that P = (a, b) is a positive integer solution and that b > a. We can generate a smaller solution in 
the non-negative integers by choosing the other intersection point Q = (a, c ) of the line x = a and curve T. 
By reflecting in the line y = x, we obtain a solution P' = (c, a), where a > c. This process will generate 
continually smaller solutions until one of the coordinates is zero. 


11. Let ° ' h = k. If k = 2, then there are trivially no solutions. Otherwise, if k > 3 and we have an integer 

a b +1 

solution (a, b), we can generate a solution where b = 0. So, there is a solution in the positive integers if and 

a 1 

only if there is a value k such that — = k, which only occurs when k is a perfect square. 


12 . 


Consider the equation a 2 + b 2 + 1 = ab k. This can be algebraically manipulated to (a - b) 1 + 1 = (k - 2) a b\ 
hence, it is obvious that k > 3. We now fix the value of k. As there are no solutions for a = 0 or b = 0, and 
every solution (a, b) yields alternative solutions (-a, -b ) and ( b , a), the curve must be a hyperbola with 
centre (0, 0) and diagonal lines of symmetry. Moreover, the hyperbola is contained entirely within the first 
and third quadrants. Using Vieta jumping, we can get from a solution (a, b) to (bk - a, b). If (a, b) lies to 
the right of the stationary point of the hyperbola, then this will generate a smaller (in terms of a + b) 
solution. We can then reflect in the line a = b and repeat the process until (a, b) lies to the left of this 
stationary point and below (or on) the line a = b. By considering the discriminant of a 2 - b k a + b 1 + 1 = 0, 


the stationary point can be located as 


. The only integer point lying to the left of this such 


, ^ k 2 -4 k 2 -4 

that a > b is (1, 1), which clearly is only a solution when k = 3. The graph below is the hyperbola for k = 5. 



-l 


2 



14. If we have two solutions (not necessarily distinct), (a, b ) and (c, d), then a 2 - n b 2 = 1 and c 2 - nd 2 - 1 . We 
can multiply them together and use Euler’s identity to obtain (ac + bd n ) 2 - n(ad + be ) 2 = 1, which is a 
strictly larger solution to the equation. Repeating the process from some initial solution ( b , a), we expand 
[a 2 - n b 2 f using Euler’s identity recursively to obtain infinitely many solutions, one for each k e Pi 
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15 . For every triangular number T, 8 T + 1 is square and the converse also holds. Hence, we want to solve the 
Pell equation x 2 = 8 y 2 + 1. A preliminary solution is that 36 is both square and triangular, or 
17 2 = 8-6 2 + l, from which we generate an infinitude. 


16 . Suppose we have a rational ^ , where a and b are coprime. Squaring it results in j; , where a 2 and b 1 are still 
coprime. Hence, squares of rationals (or sums of one square) have both a square numerator and denominator 

i i*i ns a c 2 e 2 c 2 f 2 +e 2 d 2 . •* * * r . 

when expressed in lowest terms. Suppose - = — + — = —;— is expressible as the sum of two rational 

squares. Then, it is expressible as a square of a rational multiplied by the sum of squares of two integers, 
therefore of the form N = t 2 2 k (p\ p 2 ... p„), where each p , is a prime congruent to 1 (modulo 4), k is an 
integer and t is a non-negative rational. For sums of three squares of rationals, we can express it as a square 
of a rational multiplied by the sum of squares of three integers. Hence, it is something that cannot be 


expressed as 


A k (8 m+7) 


where t is an odd integer, k is an integer and m is a non-negative integer. Any rational 


can be expressed as the sum of four squares of rationals. 


17 . We begin with a solution to a 2 + b 2 = c 2 , i.e. a Pythagorean triple. Multiplying by a 2 gives us 

a 4 + b 2 a 2 = c 2 a 2 . Then multiply by b 4 a 4 , giving us b 4 a 8 + b 6 a 6 = c 2 b 4 a 4 . Finally, multiply by c 48 b 96 a 96 , 
resulting in c 48 b 100 a 104 + c 48 b 102 a 102 = c 50 b 100 a 100 . This is clearly a solution to the equation x 4 + y 6 = z 10 . 


18 . If x = 1 (mod 4) and y = 2 (mod 4), then clearly x + y £ A. All numbers of the form 13 b 2 are in A, so if we 
can find infinitely many solutions satisfying x 13 + y 13 = 13 b 2 and the modulo-4 congruences then we are 
done. Start with a solution to w 13 + z 13 = 13 u, where w = 1 and z = 2 (mod 4). We note that u must be 
congruent to 1 (mod 4), as w 13 = 1 and z 13 = 0. We multiply by u 13 to give a solution 
(u w) 13 + (u z) 13 = 13 (« 7 ) 2 , without changing the congruence class of any of the variables. We can start with 
w = 13 + 52 k and z = 26, generating a solution for every k e As these solutions can become arbitrarily 
large, there must be infinitely many. 


19 . For each 1 < j < 2 2012 - 1, consider the Diophantine equation of the form £a, 2 = bj Pi , where a, 

(1 < i < 2012) is included in the sum if and only if the ith binary digit of j is 1 , and pj is the /th odd prime. 
We then apply the Sam Cappleman-Lynes technique to all equations simultaneously starting from the 
2 2011 - 1 equations of the form X+T = >'/• We initially set x, = i, and then multiply all equations by an 
appropriate power of y 2pi Pl Pj ~ l . This preserves the perfect-power property of all previous equations, and 
we can do this until the /th equation is also converted to the desired form. Repeating for all 2 2012 - 1 
equations gives us a solution to the original Diophantine equations. Then just take 
S = [a i 2 , a 2 2 , ..., fl 2 2 oi 2 _ 1 2 }. 


20 . The line passes through two rational points, so must have rational gradient. Hence, we can express y as a 
linear function of x with rational coefficients. Let a, b, c be the abscissae of A, B, C, respectively. The 
intersection of the cubic curve and the line is a cubic equation with rational coefficients and roots a, b, c. 
Due to Vieta's formulas, a + b + c is one of the rational coefficients. Hence, the rationality of a and b 
implies that of c. 


21 . Let <h be the union of the lines { Y, -(X + Y ), X},{-(Y + Z), 0,Y + Z) and {Z, X + Y, -((X + Y) + Z )}. 
Similarly, let ¥ be the union of the lines {T, -(T + Z), Zj, [-(X + Y), O, X + Y] and 
{X, Y + Z, -(X +(Y + Z))}. The three cubic curves L <J> and ¥ intersect in eight points, namely 
{X, T, Z, -(X + T), -(y + Z), X + Y, Y + Z, 0>, so must intersect in the ninth point by Cayley-Bacharach. 
Hence, -((X + Y) + Z) = -(X + (Y + Z)), and thus (X + Y) + Z = X + (Y + Z). 


22. The tangent lines passing through A and B do so with multiplicity 3, so we have 3 A = 3 B = 0. We then 
have 3 C = -3 (A + B) = 0, so C is also a point of inflection. 
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Conic sections 

Conics have recurred throughout this book in both geometric and algebraic settings. Hence, I have decided to 
dedicate the final chapter to them. As the only conics appearing on IMO geometry problems are invariably circles, 
the results proved in this chapter are largely irrelevant. Nevertheless, the material is sufficiently interesting to be 
worthy of inclusion. 


Sections of cones 


With the obvious exception of the circle, the conics were first discovered by the Greek mathematician Menaech- 
mus who contemplated slicing a right circular cone C with a flat plane A. Indeed, the term ‘conic’ is an abbrevia¬ 
tion of conic section. 



It is more natural to consider C as the double cone with equation x 2 + y 2 = z 2 . If A cuts both cones, then the conic 
section is a hyperbola. If it cuts only one cone in a closed curve, it is an ellipse. The intermediate case, where the 
plane is inclined at exactly the same slope as the cone, results in a parabola. 

Observe that x 2 + y 2 = z 2 is the equation of a projective circle; this explains why all conic sections are equivalent 
under projective transformations. 



We define a Dandelin sphere O to be a sphere tangent to both C (at a circle T) and A (at a point F, namely the 
focus). The plane containing T intersects A at a line 6, known as the directrix. 

1. Prove that the directrix is the polar of the focus. 
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For an arbitrary point P on the conic, we let PR meet T at Q. 

2. Prove that PQ = PF. 

3. Let A be the foot of the perpendicular from P to the plane containing F. Let D be the foot of the 

PQ 

perpendicular from P to <5. Show that — is independent of the location of P. 

By combining the two previous theorems, we establish the focus-directrix property of a conic section. 

P F 

■ For every point P on a conic section, the ratio — = s remains constant, e is known as the eccentricity of the conic. 

[Focus-directrix property] 

The type of conic section can be determined by its eccentricity. 

4. Show that e < 1 for an ellipse, e = 1 for a parabola and e > 1 for a hyperbola. 


Conics on a plane 


The focus-directrix property enables us to give conic sections a Cartesian treatment. By allowing the directrix to 
be the x-axis and scaling the conic so that the focus is at (0, 1), the equation of a conic becomes 
x 2 + (y - 1 ) 2 = £ 2 y 1 . We can see that a conic section is a quadratic curve (although this is obvious from the 
projective definition). More remarkably, the converse is also true: all non-degenerate quadratic curves are conic 
sections. 

5. Prove that, if e F 1, the conic has two lines of reflectional symmetry. 

Flence, for ellipses and hyperbolae, we can reflect the focus and directrix in the line of symmetry to obtain an 
alternative focus and directrix. Returning to the Dandelin spheres, the other focus corresponds to placing the 
sphere below the plane instead of above it. A parabola can be regarded as an ellipse/hyperbola with a focus on the 
line at infinity. 

The Cartesian equation also makes it evident that ellipses are indeed ‘squashed (affine transformed) circles’. We 
can translate the ellipse so that the lines of symmetry are coordinate axes, giving us the equation for an ellipse. 

x 2, y 2 y 2 

■ —r ; = 1 is the general equation for an ellipse. If we reverse the sign of , we obtain a hyperbola instead. 

[Cartesian equations for ellipses and hyperbolae] 



Consider an ellipse, with its two foci (F\ and Fi) and directrices. Let P be a variable point on the ellipse. 


6. Prove that P Fj + P F 2 is constant. 
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7. Show that the angles between the tangent at P and the lines P F\ and P Fn are equal. [Reflector property of 
the ellipse] 

A parabola can be considered to be the limit of ellipses with one focus fixed and the other tending towards 
infinity. This gives us the reflector property of the parabola, which states that a pencil of rays originating from the 
focus is reflected to a pencil of parallel lines perpendicular to the directrix. This was known to Archimedes, and 
formed the basis of a mechanism for igniting the sails of enemy ships by reflecting sunlight from polished metal 
shields. Nowadays, it is used in Newtonian telescopes for focusing light from infinity. 



Let the focus be the origin, and the directrix be represented with x = d in Cartesian coordinates. Consider the 
polar coordinates {r, 9) = (r cos 6, r sin 6). 

8. Prove that the equation for a conic in polar coordinates is r =--—, where l = ed. [Polar equation of a 

1+ecos 6 

conic] 

This parametrisation of a conic will prove useful when verifying Kepler’s laws of planetary motion in the next 
section. 

The length l = sd is known as the semi-lcitus rectum, as it is half of the length of the line segment parallel to the 
directrix passing through the focus and meeting the conic twice. The latus rectum is shown in black on the 
diagram of the parabolic reflector. 


The return of Steiner’s porism 


In Steiner’s porism, we considered the family of circles internally tangent to T 2 and externally tangent to Tj. Let’s 
suppose these two circles now intersect. Instead of a finite Steiner chain, we obtain an infinite set of circles 
bounded by T! and T 2 - 
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9. Show that the centres of the circles lie on an ellipse, the foci of which are the centres of T [ and r 2 . 

10. Hence demonstrate that the radius of the variable circle is proportional to the distance between its centre 
and the radical axis of T ( and r 2 . 

11. Three circular arcs, yi, y 2 and 73 , connect the points A and C. These arcs lie in the same half-plane defined 
by the line A C in such a way that 72 lies between and 73 . Three rays, hi, h 2 and h^, emanate from a point 
B on the line A C, resulting in a grid of four curvilinear quadrilaterals as shown in the diagram below. Prove 
that if one can inscribe a circle in each of three of the curvilinear quadrilaterals, then a circle can be 
inscribed in the fourth. [IMO 2010 shortlist, Question G7, Geza Kos] 



Geza created many more problems on this theme, all of which are amenable to embedding in three-dimensional 
space. One equivalent problem is where the situation is on the surface of a sphere, which can be transformed into 
the original problem through stereographic projection. Similarly, there are variants in hyperbolic space. 


Kepler’s laws of planetary motion 


The vast majority of the content of this book is exclusively in the realms of pure mathematics. Nevertheless, conic 
sections naturally occur as the paths traced by objects in gravitational fields. The elliptical orbits of planets were 
first proposed by Johannes Kepler, as a refinement of earlier (mostly Greek) ideas that celestial bodies travel in 
perfect circles. Isaac Newton later inferred his law of gravitation from Kepler’s laws; the derivation is not too 
difficult, although it relies heavily upon differential calculus. 

Consider an object of negligible mass moving around a fixed object O of large mass due to gravitational attrac¬ 
tion. An example of this is the Earth orbiting the Sun. We aim to show that the path must be a conic section, by 
showing that the polar equation of a conic satisfies Newton’s laws of gravitation. Suppose an object is initially at 
P and moves to Q (very close to P). Let A = [P O Q] be the area ‘swept out’ by the object, and consider the 
derivative —, known as the areal velocity. 

dt J 
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o 



12. If an object moves in a straight line at constant velocity, show that — is constant. 

dA 

13. If the acceleration of an object is entirely radial (towards or away from O ) at all times, then show that — 
again remains constant. [Conservation of angular momentum] 

Indeed, the converse is also true: if areal velocity is conserved, then acceleration is entirely radial. By integrating 

dA 

— with respect to time, we obtain Kepler’s second law. 

■ A planet P in orbit around the Sun O sweeps out equal areas in equal intervals of time. [Kepler’s second law] 

If P = (r, 8), then — = - r 1 — . So, the value of r 2 — must remain constant. Let’s refer to this value (twice the 

' ’ ' dt 2 dt dt v 

areal velocity) as k. 

In ordinary circular motion, the radial acceleration is given by -r(^ Hence, in the general case, radial accelera- 

.. . Sr rdd\2 

tion equals a = — - 

4 dA t dt > 

k 2 

14. Show that, if the planet P follows the path of a conic with focus O and semi-latus rectum /, then a = ——. 

r 2 l 

[Newton’s inverse square law] 

Conversely, if we assume the inverse square law a = - , then we can choose a conic with centre O, passing 

r 2 

k 2 

through P in the appropriate direction, and with a latus rectum of l =-. As Newton’s law of universal gravita- 

G M 

tion is deterministic, the conic must be the unique solution. Hence, the converse is also true: all planets obeying 
the inverse square law travel in conic orbits. If the orbit is cyclic (and therefore closed), then it must be an ellipse. 

■ A planet P describes an ellipse, one focus of which is the Sun O. [Kepler’s first law] 



By considering the equation r = 


--—, the value of r is minimised at — (the perihelion ) and maximised at — 

1+ecos6 1+e ^ 1 -s 
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(the aphelion). When the eccentricity is zero, the orbit is circular. 

In general, two bodies experiencing gravitational attraction will orbit each other in coplanar conic orbits, where 
the barycentre of the system (assumed to be stationary) is their common focus. For three or more bodies, the 
equations cannot be solved algebraically, and the system behaves chaotically (arbitrarily small initial perturba¬ 
tions lead to arbitrarily large effects). Indeed, it has been shown to be undecidable, so no computer or Turing 
machine is capable of calculating the movements with perfect precision. 


Areas of conics 


If we take the pole of the line at infinity, we obtain the centre of a conic. For parabolae, this point is at infinity, 
therefore does not lie on the affine plane. For ellipses and hyperbolae, however, the centre lies on the plane and 
can be taken as the origin. It is then possible to apply a rotation about the origin to place the conic in standard 
position. 




yp -p 

The ellipse has Cartesian equation — + — =1. The line y = xtan 8 meets the curve at P = (a cos 8, b sin 8). The 

area bounded by the curve, the line O P and the positive x-axis is given by ^ a b 8. In particular, when 8 = 2 n, the 
total area of the ellipse is nab. 

yp yp 

Similarly, the hyperbola has Cartesian equation — - — = 1. The line y = xtanh^ meets the curve at 
P = (a cosh (/>, b sinh </;). The area bounded by the curve, the line O P and the positive x-axis is given by ■ abcb. 
The hyperbolic functions are defined in a similar way to the trigonometric functions, with cosh tf> = ^ (e^ + e~ 
and sinh (t> = \ (e^ - e~ 



The area of a parabolic segment is much easier to calculate, as it can be obtained by integration of the equation of 

X 2 

the parabola, y = —, with respect to x. Archimedes instead used completely Euclidean methods in his Quadrature 
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of the Parabola , determining the area recursively by adding together the area of triangle ABC with the areas of 
the parabolic segments below A C and B C. M is the midpoint of A B, and C is the intersection of the parabola 
with the perpendicular from M to the directrix. The triangle constructed by repeating this process with A C 
instead of A B has one eighth of the area of the original triangle. By summing an infinite geometric series, the 
total area is equal to [A B C] (l + -jj + + jp; +...) = ^ [A B C]. 

Unlike areas, the arc lengths of ellipses are not easy to compute. The circumference of an ellipse is 4 a E(e), where 
E is the complete elliptic integral of the second kind. With the exception of the circle, where E( 0) = ^ and the 

circumference is 2 7 rr, E(s) cannot be expressed in terms of basic functions. 


Inversion in arbitrary conics 


In the chapter about the Riemann sphere, we considered inversion in a circle. It is, however, possible to invert 
about any conic section. We have a seven-parameter set of inversions we can apply, as the centre of inversion and 
conic can be chosen independently. 

■ Let T be a non-degenerate conic, and O be a point not on T. For any point P other than O , we draw the line / through 
O and P, and let it meet T at A and B. We then define P' to be the projective harmonic conjugate of P with respect to 

A B. [Inversion in a conic] 



Equivalently, we can define P' as the intersection of the polar of P with the line O P. 

To investigate the properties of conic inversion, apply a projective transformation to make T a circle and O its 
centre. Then, the basic theorems applying to ordinary inversion translate into projective versions. 

■ O inverts to an entire line ft, namely the polar of O, and vice-versa. (If O is the centre of the conic, then this is the line 
at infinity.) We treat this line as a single point, so the projective plane becomes topologically equivalent to a sphere. 

Allow ft to intersect T at the points / and J. 

In this perverse world of conic inversion, ft behaves like the point-line at infinity and I and J are analogous to the 
circular points. This enables us to convert theorems in circle inversion to their conic counterparts. 

■ Straight lines passing through O remain invariant under inversion. 


■ Conics containing O, I and J invert to straight lines not passing through O , and vice-versa. 
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■ Conics containing I and J (but not O ) invert to other conics containing I and J (but not O). 

In circle inversion, angles between curves remain constant (or, more precisely, are reversed). In conic inversion, 

this must be converted into a projective statement. 

15. If P and Q invert to P’ and Q', respectively, then show that P Q P' Q' 1J are conconic. 

16. Let curves C and D intersect at P. Let C" and D' be the inverse curves with respect to V. and let P' be the 
inverse of P. The tangent to C at P and the tangent to I )' at P ' intersect at R. Similarly, the tangent to l) at P 
and the tangent to C" at P' intersect at S. Show that P P' RS1J are conconic. [Preservation of generalised 
angle] 
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Solutions 


i. 


2 . 

3. 


4. 


5. 


6 . 


7. 


8 . 

9. 


10 . 


11 . 


Apply a projective transformation to take 8 to infinity, then apply an affine transformation to return PL to 
being a sphere. The cone is tangent to O, so remains a right circular cone. The plane containing T becomes 
parallel to the plane A (i.e. horizontal). By symmetry, F is now the centre of the conic (which is a circle), 
and therefore the pole of the line at infinity 8. 


They are both tangents from P to 1 \ therefore of equal length. 


The angle L A P Q in the right-angled triangle is constant (equal to half the angle at the vertex of the cone), 

PA P D 

so — is independent of the location of P. Similarly, —- is also constant, by considering the right-angled 
P Q PA 

triangle PAD. 


Observe that e = — = losLHPD When e = 1, the plane A is inclined at the same slope as the cone, thus 

PD cos LHPQ ’ P V 

creating a parabola. When e < 1 , the plane is shallower than this, so the conic is an ellipse. Conversely, 
when the eccentricity exceeds 1 , we have a hyperbola. 


The equation of the conic is x 2 + (l - £ 2 ) y 2 - 2 y + 1 = 0. We can complete the square, resulting in 
x 2 + (l - e 2 ) fy - -j—;) - Y~, • This is symmetric about the lines x = 0 and y = . 


Let the feet of the perpendiculars from P to the directrices be D\ and D 2 . We have 
PF l +PF 2 = s(PD 1 +PD 2 ) = s(D 1 D 2 ). 


Let Q be another point on the ellipse, very close to P. Let F\ P = u, F\ Q = u + 8 and F 2 Q = v. By the 
previous theorem, F 2 P = v + 8. Let P Q = y. We apply the cosine rule, to get 

cos lF\P Q- " +y o { " +S) = 7 S S . As Q approaches P, the y 2 and 8 2 terms become negligible, and this 
cosine equates to -- . This is the same as cos lF 2 Q P, so the rays Tj P and F 2 P describe equal angles with 
the normal to the curve. This establishes the reflector property. 

From the focus-directrix property, the equation for the conic is r = s(d - x) = £(d - r cos 9). Rearranging this 
equation gives us the formula for r. 


Let the variable circle have centre P and radius r, and let T, have radius R, and centre O,. We have 
P 0\ = r + R\ and P 0 2 = R 2 - r. Adding the equations gives P 0\ + P 0 2 = R\ + R 2 , which is constant. 
Hence, the locus of centres is an ellipse with foci 0\ and 0 2 . 


r = P 0\ - R\ is a linear function of the distance to the focus, which is a linear function of the distance to 
the directrix, which is a linear function of the distance to the radical axis (which is parallel to the directrix) 
As the radius of the variable circle tends to zero as it approaches the radical axis, this linear function must 
have a constant term of 0. Hence, the radius of the circle is proportional to the distance between its centre 
and the radical axis. 


Let the three circles be Cj, C 2 and C 2 , such that (without loss of generality) C 2 and Cj are tangent to both h \ 
and h 2 , and Cj and C 2 are tangent to both y t and y 2 . Let a fourth circle Cj be tangent to y 2 and y 2 , and let its 
centre lie on the line B Cj. Let c/,- denote the perpendicular distance between the centre of C,- and the line 

r d 

A C, and let r, denote its radius. We want to show that — = —, as this means that the circles are homothetic 
with centre of homothety B, so C 4 is tangent to both h 2 and h 2 . Using this idea of homothety, we have 
— = -y. Similarly, the previous exercise gives us — = and — = -y. We can multiply these three equations 

r 2 d 2 r 3 d 3 r x d x 
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r d 

to prove that — = - 7 , so C 4 is indeed inscribed in the remaining curvilinear quadrilateral. For further 

r l "1 

discussion of the problem, see http://www.imo-official.org/problems/IMO2010SL.pdf. This includes a three- 
dimensional interpretation of the problem where cones of equal gradient are erected on the circles. 

12. Let d be the perpendicular distance from the locus of motion to 0. Then [ 0 P Q] = ^ v dt, so ^ v d is 
constant. 


13. If P continues in a straight line at its present velocity, let the new position after time t be denoted Q. If P 
instead is accelerated towards or away from 0, then its new position is denoted Q'. As the acceleration is in 
the direction of 0, and t is very small (technically, the limit as t -> 0), 0 P is parallel to 00'. So, 

dA 

|O Q P] = [0 Q' P] and thus — is unaffected by the acceleration. So, it must remain constant, as in the 
previous scenario. 


14. We begin with the polar form of a conic, r = 


1+e cos B 


. We then rearrange to obtain - = 1 + e cos 8, and 


differentiate both sides with respect to time. This gives us - = —e sin 8 . Multiplying both sides by 

dr 


(fir k e d6 /I 1 \ 

— gives us — = — sin 8. Proceeding to differentiate again results in — = — cos 8— =k\ -) —, 

l ° dt l ° ° dt 2 l dt \r l' dt 

where the last stage involves substituting the equation of the conic back into the equation. Acceleration is 
then a~^~ - -1 — - d—\ 2 - > 


-A—V = k(- - 1 )- =-— 

' \ r 0 dt \dt> L Ur 2 r 3 r 2 /' 


15. Project / and J to the circular points at infinity, so 0 is the centre of the circle T. P Q P' Q' are concyclic, 
thus P QP' Q' IJ are conconic. 

16. Again, project I and J to the circular points at infinity, so 0 is the centre of the circle T. Then, this 
statement equates to P P' R S being a cyclic quadrilateral, which is obvious from the fact that angles are 
preserved in circle inversion. 
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Glossary 

When writing the book, I have assumed you are familiar with terminology such as ‘orthocentre’ and ‘geometric 

mean’. As this may not necessarily be the case, some common terms are explained here. 

• abscissa: the .r-coordinate of a point on the plane. Compare with ordinate. 

• altitude: a line from a vertex of a triangle, which is perpendicular to the opposite side. The three altitudes 
intersect at the orthocentre. 

• AM-GM inequality: for n non-negative real numbers, the arithmetic mean is greater than or equal to the 
geometric mean, with equality if and only if all variables are equal. 

• Apollonius’ theorem: in a triangle ABC, where M is the centre of B C, we have A M 2 = -^ f r + \ c 1 - ^ a 2 . 

• areal coordinates: a system of projective homogeneous coordinates where each point is considered to be the 
weighted barycentre of three variable masses, each of which is positioned at a vertex of a fixed ‘reference 
triangle’. 

• Argand plane: the idea of representing the real and imaginary parts of a complex number as the Cartesian 
coordinates of a point on the Euclidean plane. 

• arithmetic mean: for n variables [x\, ..., x„), the arithmetic mean is - (xi + ... + x„). 

n 

• barycentre: the centre of mass of a set of masses positioned at points (on the plane). 

• barycentric coordinates: a synonym of areal coordinates. 

• Bezout’s theorem: two algebraic curves of degrees m and n intersect in precisely m n points on the complex 
projective plane, when counted with the appropriate multiplicity. 

• Brahmagupta’s formula: for a cyclic quadrilateral with side lengths a, b, c, d and semiperimeter s, the area is 
given by V (s - a) (s - b) (5 - c) (.? - d) . This is a generalisation of Heron’s formula. 

• Brianchon’s theorem: if a hexagon ABCDEF is circumscribed about a circle (or, more generally, a conic), 
its three major diagonals (AD, BE and C F) are concurrent. 

• Cardano’s formula: the general solution to a cubic equation. 

• Catalan sequence: a sequence of integers that counts the number of valid strings of 2 n parentheses. 

• Cauchy-Schwarz inequality: for two vectors u and v, u ■ v < \ u \ \ v |, with equality if and only if u and v have 
the same direction. 

• Cayley-Bacharaeh theorem: if two cubics intersect in nine points and a third cubic passes through eight of 
those points, then it also passes through the ninth. 

• Cayley-Menger determinant: a formula for the square of the volume of a simplex in terms of the squares of 
the side lengths. 

• centroid: the intersection of the three medians of a triangle. More generally, it is synonymous with barycentre. 

• Ceva’s theorem: if D, E and F are points on the (possibly extended) sides BC,C A and A B, respectively, then 
AD, B E and C F are concurrent if and only if zz; • ziz • zz = 1 • 

DC EA FB 

• circular points at infinity: a pair of points on the complex projective plane through which all circles pass. 

• circumcentre: the point O equidistant from the three vertices of a triangle. 

• circumradius: the radius R of the circumscribing circle of a triangle or cyclic polygon. 

• collinear: points lying on the same straight line. 

• concentric: objects sharing the same centre. This is usually applied to circles, but is equally applicable to 
conics. 

• conconic: points lying on the same conic section. 
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concurrent: three (or more) lines are said to be concurrent if they all intersect at a single point or are all 
mutually parallel. 

coneyclic: points lying on the same circle. 

conic: a curve in the plane described by a quadratic equation in Cartesian coordinates, 
coplanar: points (or curves) lying on the same flat plane, 
coprime: two integers with a greatest common divisor of 1 . 
cosine rule: for a generic triangle A B C, a 2 = b 2 + c 2 - 2 b c cos A. 

cross-ratio: for four collinear points, the ratio AB CD . If the cross-ratio is -1, the points form a harmonic 

bcdA 

range. 

cube roots of unity: the three roots of the polynomial z 3 - 1. We often use to to represent the ‘north-west’ 
complex cube root of unity i- \ . 

Desargues’ theorem: two triangles are in perspective about a point if and only if they are in perspective about 
a line. 

difference of two squares: the polynomial a 2 - b 2 = (a - b) (a + b). 

difference of three cubes: the polynomial a?+b 2 + C 2 - 2>abc = (a+ b + c) (a+ bco + c or) (a + b to 1 + c co), 
where to is a primitive cube root of unity. 

Euclid’s algorithm: the greatest common divisor of a and b can be obtained by subtracting the smaller from 
the larger and repeating until one of the numbers is zero. For example, 

(26, 10) —> (16, 10) -»( 6 , 10) -» ( 6 , 4) -> (2, 4) -> (2, 2) -> (2, 0), so the greatest common divisor of 26 and 10 is 
2 . 


Euler-Apollonius lollipop: the disc on diameter GH, which contains the incentre , symmedian point and 
Gergonne point. 

Euler-Fermat theorem: if a and n are coprime, then a' ftn] = 1 (mod «), where <p(n) is the number of positive 
integers < n which are coprime to n. 

Euler line: the circumcentre, centroid, nine-point centre and orthocentre are collinear in the ratio 
OG.GT \T H = 2:1:3. 

Euler’s inequality: O I 2 = R 2 - 2 R r, where the circumcircle has centre O and radius R, and the incircle has 
centre I and radius r. 

excentre: the centre of an excircle. 


excircle: one of three circles (other than the incircle) tangent to (the extensions of) the three sides of a triangle. 

Feuerbach’s theorem: the nine-point circle is tangent to the incircle and three excircles. 

Fibonacci sequence: the sequence defined with Fq = 0, Fj = 1 and F n+2 = F „, ( + F„. If you extrapolate it 
backwards, you obtain the ‘nega-Fibonacci numbers’. 

fundamental theorem of algebra: a degree-c/ polynomial can be factorised into d linear factors over the 
complex numbers. 

fundamental theorem of arithmetic: every positive integer has a unique prime factorisation, 
geometric mean: for n variables {jti, ..., x n }, the geometric mean is V x i x 2 ... x„ . 

Gergonne point: the intersection of the lines joining each vertex of a triangle to the point of tangency of the 
incircle with the opposite side. 

glide-reflection: the composition of a reflection in a line and a translation parallel to the line, 
harmonic mean: for n variables {x\ , ..., x„], the harmonic mean is —— p 2 --. 

*1 x 2 x n 

harmonic quadrilateral: a cyclic quadrilateral where the products of opposite side lengths are equal. 
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• harmonic range: a set of collinear points with a cross-ratio of -1. 

• Heron’s formula: if a triangle has side lengths a, b, c and semiperimeter s, the area is given by 
V s(s — a)(s — b) (s - c) . It is a special case of Brahmagupta’s formula. 

• heterochromatic: differently-coloured. 

• homothety: a synonym of enlargement, homothecy, scaling, dilation or dilatation. 

• incentre: the centre of the incircle of a triangle (or, more generally, inscribable polygon). 

• incircle: the circle tangent to the three sides of a triangle and contained within it. 

• inradius: the radius r of the incircle. 


• intersecting chords theorem: if there is a point P in the plane of a circle T, and a line / passing through P and 
meeting T at A and B, then the value of P A P B is independent of 1 and equal to the power of the point P. 

• median: a straight line joining a vertex of a triangle to the midpoint of its opposite side. 

• Menelaus’ theorem: if If E and F are points on the (possibly extended) sides B C, C A and A B, respectively, 

then D, E and F are collinear if and only if • rzi • = -1. 

DC EA FB 

• monic polynomial: a polynomial of degree n where the coefficient of x" is 1. Every polynomial is a scalar 
multiple of a monic polynomial. 

• monochromatic: everything is the same colour. 

• Nagel point: the intersection of the lines joining each vertex of a triangle to the point of tangency of the 
opposite excircle with its corresponding side. 

• nine-point circle: the circle passing through the midpoints of the sides, the feet of the altitudes and the 
midpoints of AH, B El and C H, where H is the orthocentre. 

• nth roots of unity: the n roots of the complex polynomial z n - 1. If it cannot be expressed as a /nth root of 
unity for some m < n, then it is known as ‘primitive’. The monic polynomial whose roots are the </>(«) primitive 
nth roots of unity is known as a ‘cyclotomic polynomial’. 

• ordinate: the y-coordinate of a point on the plane. Compare with abscissa. 

• orthocentre: the intersection point El of the three altitudes of a triangle. 

• Pappus’ theorem: the special case of Pascal’s theorem when the conic is a pair of straight lines. 

• parallelepiped: a three-dimensional version of a parallelogram, obtained by applying a generic affine 
transformation to a cube. The n-dimensional generalisation is called a parallelotope. 

• Pascal’s theorem: if a hexagon is inscribed in a circle (or, more generally, a conic), the three pairwise 
intersections of opposite sides are collinear. 

• power of a point: for a point P in the plane of a circle with centre O and radius R, the value of OP 1 - R 2 is 
known as its ‘power’. 

• projective plane: an extension of the Euclidean plane where parallel lines are considered to meet on a line at 
infinity. 

• Ptolemy’s inequality: if A, B,C and D are four points in space, the inequality ABCD + BCDA>ACBD 
holds, with equality if and only if A B C D is a (non-self-intersecting) cyclic quadrilateral. 

• Pythagoras’ theorem: for a right-angled triangle ABC, where C-y, the identity a 2 + b 2 = c 2 applies. It is a 
special case of the cosine rule. 


• quadratic mean (RMS): for n variables {x \, ..., x„j, the quadratic mean is 



... + x,, 2 ) . 


• radical axis: the locus of points of equal power with respect to two circles Ti and T 2 . This is necessarily a 
straight line. 

• semiperimeter: half of the perimeter of a polygon. 
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• semiprime: the product of two distinct primes, e.g. 23 x 89 = 2047. 

• sine rule: For every triangle ABC, = —— = — 2 — - 2R, where R is the circumradius. 

sin A sin B sin C 

• Stewart’s theorem: If D is a point on the line B C, then BDDCBC + AD 2 BC = AC 2 -BD + AB 2 -DC. 

• symmedian: the reflection of a median of a triangle in the corresponding interior angle bisector. 

• symmedian point: the intersection of the three symmedians of a triangle. It has unnormalised areal coordinates 

(a 2 , c 2 ). 

• triangle inequality: each side of a triangle is smaller than the sum of the other two sides. In terms of vectors, 
this is \a_+h\ < | a | + | fc|. 
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Further reading 

If you enjoyed this book, which I hope you did, then you may find these other books, papers and websites of 
interest. 


Books 


The IMO Compendium, by Djukic, Jankovic, Mafic and Petrovic: This book is divided into a list of useful 
theorems, a massive repository of shortlisted IMO problems, and solutions to the aforementioned problems. If 
you’re training for international competitions, this will give you surplus experience. 

A Mathematical Olympiad Primer, by Geoff Smith: If you have no past experience of mathematical olympiads, 
this is the place to begin. 

Plane Euclidean Geometry, by Bradley and Gardiner: This is a rigorous exploration of Euclidean geometry, 
including more basic techniques such as angle chasing and similar triangles in addition to vectors, Cartesian 
coordinates and trigonometry. 

The Elements, by Euclid: This constructive approach to geometry begins with five postulates from which 
everything else is proved. It is possible to define ‘non-Euclidean geometries’, such as spherical and hyperbolic 
geometry, by rejecting a subset of these postulates. 

Introduction to Number Theory and Inequalities, by Christopher Bradley: Affectionately known as ‘INTI’, 
this book covers these topics in extensive detail. At the expense of losing a catchy acronym, it has since been 
separated into two disjoint books, unsurprisingly called ‘Introduction to Inequalities’ and ‘Introduction to 
Number Theory ’. 

The Symmetries of Things, by Conway, Burgiel and Goodman-Strauss: In addition to the familiar Platonic 
solids and regular tilings, there is a cornucopia of objects with fascinating symmetry. This book features a 
systematic exploration of different symmetry groups in two, three and four dimensions. 

Ait Introduction to Diophantine Equations, by Titu Andreescu and Dorin Andrica: The title of the book is 
rather self-explanatory. 

Complex Numbers from A to... Z, also by Andreescu and Andrica: This features an exploration of the Argand 
plane, including a substantial amount of triangle geometry. 

The Algebra of Geometry, by Christopher Bradley: Coordinate methods, such as Cartesian, areal and projective 
coordinates, can be employed to solve geometry problems with varying degrees of success. If you want to learn 
more about them, this book is ideal. 


Online resources 


Complex Projective 4-Space (http://cp4space.wordpress.com): My website, which is updated periodically with 
mathematical miscellany. 

Wolfram MathWorld (http://mathworld.wolfram.com): This is an online encyclopedia containing definitions and 
information about practically every mathematical concept discovered. 

Areal Co-ordinate Methods in Euclidean Geometry, by Tom Lovering 

(http://www.bmoc.maths.org/home/areals.pdf): This is more succinct than Bradley’s book, but definitely worth 
reading. 

Curves in cages: an algebro-geometric zoo, by Gabriel Katz (http://arxiv.org/abs/math/0508076): A special 
case of the generalised Cayley-Bacharach occurs when two of the curves are unions of lines. This leads to some 
interesting generalisations of Pascal’s theorem, complete with proofs. 
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Online Encyclopedia of Integer Sequences, by N. J. A. Sloane (http://oeis.org/): A continually expanding 
collection of over 200000 sequences of integers. Practically every conceivable integer sequence (well, 200000 
out of 2 n ° ) is featured somewhere within this vast repository. 

Encyclopedia of Triangle Centres, by Clark Kimberling 

(http://faculty.evansville.edu/ck6/encyclopedia/ETC.html): If you close this book and gaze at the front cover, you 
will see a configuration of triangle centres. In reality, there are uncountably infinitely many possible triangle 
centres, over five thousand of which are featured on this website. Moreover, there is a method of searching for 
triangle centres based on trilinear coordinates. 

Virtual Geoff Smith (http://people.bath.ac.uk/masgcs/): This features some papers by Geoff and Bradley, includ¬ 
ing the combinatorics of snail venom. Exciting! 

UK IMO Register, by Joseph Myers (http://imo-register.org.uk/): A hall of fame of everyone who has repre¬ 
sented this sceptred isle in any of the five main mathematical olympiads (RMM, IMO, BalkMO, EGMO and 
CGMO). Also, you can browse reports of competitions, such as Richard Freeland’s excellent and witty report of 
RMM 2011. 

The UVW method, by Tejs (http://ohkawa.cc.it-hiroshima.ac.jp/AoPS.pdf/The%20 uvw %20 method.pdf): The 
first paper describing the use of the uvw method of solving trivariate symmetric polynomial inequalities. 

What is... a Dimer, by Richard Kenyon and Andrei Okounkov (http://www.ams.org/notices/200503/what- 
is.pdf): We touched upon bipartite matchings and domino tilings in the first chapter, but this is a more detailed 
analysis. 


Discussion forums 


Ask nRich (http://nrich.maths.org/discus/messages/board-topics.html): A friendly atmosphere where everyone is 
nice and polite towards each other, helping to solve problems in an idyllically co-operative way. 

MathLinks (http://mathlinks.ro): A fierce battleground where unwitting bypassers are sadistically subjected to 
supposedly ‘trivial’ problems. The atmosphere differs greatly from nRich, being described as ‘the difference 
between carnivores and herbivores’. 
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